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ABSTRACT 

In preparation for the HERMES chemical tagging survey of about a million Galactic 
FGK stars, we estimate the number of independent dimensions of the space defined 
by the stellar chemical element abundances [X/Fe]. This leads to a way to study the 
origin of elements from observed chemical abundances using Principal Component 
Analysis. We explore abundances in several environments, including solar neighbour- 
hood thin/thick disk stars, halo metal-poor stars, globular clusters, open clusters, the 
Large Magellanic Cloud and the Fornax dwarf spheroidal galaxy. By studying solar- 
neighbourhood stars, we confirm the universality of the r-process that tends to produce 
[n-capture elements/Fe] in a constant ratio. We find that, especially at low metallic- 
ity, the production of r-process elements is likely to be associated with the production 
of a-elements. This may support the core-collapse supernovae as the 7'-process site. 
We also verify the over-abundances of light s-process elements at low metallicity, and 
find that the relative contribution decreases at higher metallicity, which suggests that 
this lighter elements primary process may be associated with massive stars. We also 
verify the contribution from the s-process in low-mass AGB stars at high metallic- 
ity. Our analysis reveals two types of core-collapse supernovae: one produces mainly 
a-elements, the other produces both a-elements and Fe-peak elements with a large 
enhancement of heavy Fe-peak elements which may be the contribution from hyper- 
novae. Excluding light elements that may be subject to internal mixing, K and Cu, we 
find that the [X/Fe] chemical abundance space in the solar neighbourhood has about 
6 independent dimensions both at low metallicity (—3.5 < [Fe/H] < —2) and high 
metallicity ([Fe/H] > — 1). However the dimensions come from very different origins 
in these two cases. The extra contribution from low mass AGB stars at high metal- 
licity compensates the dimension loss due to the homogenization of the core-collapse 
supernovae ejecta. Including the extra dimensions from [Fe/H], K, Cu and the light 
elements, the number of independent dimensions of the [X/Fe] -I- [Fe/H] chemical space 
in the solar neighbourhood for HERMES is about 8 to 9. Comparing fainter galax- 
ies and the solar neighbourhood, we find that the chemical space for fainter galaxies 
such as Fornax and the Large Magellanic Cloud has a higher dimensionality. This 
is consistent with the slower star formation history of fainter galaxies. We find that 
open clusters have more chemical space dimensions than the nearby metal-rich field 
stars. This suggests that a survey of stars in a larger Galactic volume than the solar 
neighbourhood may show about 1 more dimension in its chemical abundance space. 

Key words: methods: data analysis - ISM: abundances - ISM: evolution - stars: 
abundances - stars: AGB and post-AGB - stars: supernovae: general. 
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loss from stellar evolution, two-body effects and the 
tidal field of the Galaxy (e.g. IChernoff fc Weinberg! Il990l : 
lOdenkirchen et alJ 12003 ). After the ag gregates dissolve, 
their debris disperses and after several Galactic rotation pe- 
riods becomes mixed throughout an annular region around 
the Galaxy. In this way, the stellar disk was gradually built 
up. The goal of chemical tagging is to use element abun- 
dances to recons truct these ancient clusters in wh ich the 
stars were born (jFreeman fc Bland-HawthornI l2002[ l . Indi- 
vidual clusters are observed to be chemically homogeneous , 
at least in the elements heavier th an Na (e.g. iDe Silva et al.l 
I2OO6I : iMikolaitis et al.ll2010l . I2OIII ). Stars that were born in 
the same cluster and have now dispersed will have similar el- 
ement abundance patterns reflecting the chemical evolution 
of the gas from which they formed. This gas had its own 
history of pollution by ejecta from core-collapse supernovae, 
type la supernovae (SNe la) and Asymptotic Giant Branch 
(AGB) stars. 

Chemical space (C-space) is a space defined by the abun- 
dances of the chemical elements that we are able to measure. 
Finding the debris from long-dispersed clusters using chem- 
ical tagging is an exercise in group finding in this multidi- 
mensional C-space. For stars which formed in clusters within 
the Milky Way (MW), chemical tagging appears relatively 
straightforward. The stars were born in chemically homoge- 
neous clusters and the detailed abundance pattern changes 
from cluster to cluster. The debris from a single star clus- 
ter will be tightly clustered in chemical space. Stars can 
also come into the Galaxy in small accreted galaxies; their 
stars are again disrupted by the Galactic tidal field and dis- 
persed. If these small galaxies are like the dwarf spheroidal 
galaxies (dSph) and ultra-faint galaxies now around the 
Milky Way, they would not ha ve been be chemically ho- 
mogeneous ijTolstov et al.ll2009l ). Their broad and different 
element abundance patterns are defined by their individual 
star formation histories (SFH) and are unlike those of the 
Galactic disk. The debris of such accreted galaxies would not 
lie in a tight clump but in a streak through C-space which 
refiects the element abundance pattern in the parent dwarf. 

To make a significant recovery of dispersed aggregates, 
simulations show t hat a large sample of stars, of ord er one 
million, is needed (iBland-Hawthorn fc Freemanlliooi ). The 
galaeQ survey using the HERME^ instrument on the 
Anglo Australian Telescope is designed to do such a chem- 
ical tagging study. It will measure abundances of about 25 
elements using multi-object high resolution (R = 28,000) 
spectroscopy of about a million stars. Its C-space will have 
about 25 dimensions but the abundances of these 25 ele- 
ments do not all vary independently. The abundances of 
some elements are correlated because of the underlying nu- 
cleosynthetic processes, and the effective dimensionality of 
the C-space will be less than 25. Based on the existence of 
the various element groups (light, light odd-Z, a, Fe-peak, 
light and heavy s-process and mostly r-process) represented 
in the HERMES spectra, and from various abundance pat- 
terns observed in field and cluster stars, we think that the di- 
mensionality of the C-space will be about 7 to 9. The higher 
the dimensionality of C-space, the more power chemical tag- 
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ging will have to identify the debris of disrupted systems. 
We would like to determine the dimensionality of C-space 
more rigorously and that is one of the main purposes of this 
paper. 

It seems likely that the dimensionality of C-space will 
depend on the metallicitjjf] of the stars defining the C- 
space. For example, stars of low metallicity may have formed 
within a brief period of time and the gas from which they 
formed may have been enrich ed by only a few supernovae 
events (|Audouze fc Sillj|l995l ). Among the neutron-capture 
(n-capture) elements, there would not have been time for 
AGB stars to provide enrichment of s-process elements, so 
the r-process may dominate the chemical evolution. On the 
other hand, the more metal-rich stars may have had a long 
history of chemical evolution with more than one process 
contributing to the n-capture element abundances. 

We will use Principal Component Analysis (PCA) to 
determine the dimensionality of C-space for various samples 
of stars, including field stars in different metallicity intervals, 
star clusters and stars in the Fornax dSph galaxy and the 
Large Magellanic Cloud (LMC). This is because the com- 
ponents will be different in the different situations. We will 
attempt to interpret the principal components in terms re- 
lated to nucleosynthesis processes and hopefully gain some 
insight into the dominant processes in each situation. 

This paper is organized as follows: Section[2]gives a brief 
discussion of some related aspects of chemical evolution pro- 
cesses and sites. In Section O we give details on the samples 
that we chose to study. In Section |4l we will describe the 
PCA method and a way to estimate intrinsic correlation us- 
ing Monte-Carlo simulations. In Section (5] we will present 
the results of PCA analysis and then in Section [6] we will 
interpret the results and discuss their origins. We summarize 
our main conclusions in Section [T] 



2 CHEMICAL EVOLUTION PROCESSES 

Stellar element abundance studies are usually presented in 
terms of [X/Fe]-[Fe/HQ diagrams. These illustrate many ba- 
sic element behaviour patterns, such as decreasing trend of 
[a/Fe] toward higher [Fe/H] which is usually attributed to 
the increasing contribution of SNe la to the chemical evolu- 
tion at later times. These two-dimensional plots do not read- 
ily reveal the interplay between different families of elements 
during the different stages of chemical evolution. This inter- 
play is of particular interest to us in trying to determine the 
dimensionality of C-space. As chemical evolution proceeds, 
different groups of elements evolve together and define indi- 
vidual dimensions of the C-space. We seek an approach that 
is able to display these groups of elements, using PCA that 
we will present here. This will hopefully benefit the analy- 
sis of elemental abundances that will be obtained soon for 
large samples of stars with high resolution multi-object sur- 
veys such as APOGE M^KWende Prieto et al..,200a '). HER- 

We assume that the iron abundance [Fe/H] is a pertinent tracer 
of the metallicity in this paper. 

4 By definition, [X/Y] = logio(Nx/Ny)star " logio(Nx/Ny)Q, 
where Nx and Ny are the abundances of element X and element 
Y respectively. 

^ The Apache Point Observatory Galactic Evolution Experiment 
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MES and the ESO-Gaia VLT survey. First we briefly discuss 
major production sites for different families of elements. 

a-elements such as O, Ne, Mg, Si, S, and Ca are 
mainly produced by core-collapse supernovae (i.e. Type 
II, lb and Ic Supernovae), while Fe-peak elements such 
as Cr, Mn, Ni, and Fe are mainly produced by SNe 
la. Although the progenitor models are still debated, 
the onset delay for these SNe is in a range of 0.1 
- 2 Gyr after star form ation begins (see model from 
iKobavashi fc NomotdlioogI ) . Observations show complica- 
tions such as over-abundances of heavy Fc-pcak eleme nts 
such as Zn and Co at low metal licity (McWilliam c t al. 
199E : iRvan. Norris fc Beerd Il996l: [korris. Ryan fc Beers 



ble to the /3-decay rate. 



The major sites for the s- 
low-mass (1.5 - 3 Mr) AGB 



200ll : ICavrel et al.ll2004l : IChen. Nissen fc Zhaoll2004l ). These 
cannot be understood from core-collapse supernovae models 
with explosion energy of about 10^^ erg. lUmeda fc Nomotg 
l(2002 . 2005 ) showed that more energetic supernovae (hy- 
pernovae) are responsible for t he over-abundance s of Z n 
and Co. This is suppo rted by IChiefli fc LimongH l|2002h . 
iHeger fc WooslevI l|2010l ) argued that over-abundances of Zn 
can be explained without involving hypernovae. However, 
llzutani fc Umedal ^010) showed that it is necessary to in- 
clude hypernovae in order to explain both Co and Zn obser- 
vations. 

Neutron-capture elements {A > 65) can provide 
independent probes for Galactic chemical evolution. There 
is not yet a consistent scenario to explain all observations 
of n-capture elements. Physically, the n-capture processes 
are divided into slow (s— ) and rapid (r—) processes. In 
the r-process the neutron flux is intense and the time-scale 
of n-capture is much shorter than that of the /3-decay. 
This causes the seed nucleus to grab many neutrons 
before it /3-decays to the valley of stability. The observed 
abundance patterns of r-process elements show very small 
star-to-star variation, and are in excellent agreement with 
the scaled solar r -process curve, at least for elements 



56 < Z < 72 ( e g. Isiieden et al.l[l99^: IWestin eTsU I2OO0I : 
iHill et all I2OO2I : ICowan et al] I200I I2OO5I ). This suggests 
the universality of the r-process nucleosynthesis. How- 
ever, the site(s) of the r-process remain uncertain. The 
requirements on the physical conditions are neutron rich 
(i.e. low electron fra ction Ye), high entropy and short 
dynamic tim e-scales JWanaio fc Ishimajul I2OO6I ): but see 
caveat from iFreiburghaus et al.l (|l999al ). A possible site 
is the neutrino-driven neutron-wind during the forma- 
tion of neutron st ars in core-collapse supernovae (e.g. 
IWooslev et al]|l994l '). Various scenarios of this possibility 
have been stud ied, including association with low-mass 
supernova e (e.g. Wanaio ct al. 2009) or massive supernovae 
l|Truran et al..,2002. : Wanaio fc IshimarUi2006 | ). such as the 
^/-dri ven He-shell mechanism ( Baneriee. Haxton fc Qianl 
120111 1. Another alternativ e site is neutron star merg- 
ers IF reiburghaus, Ro sswog fc Th iclcmannl Il999bl : 
iRosswog et al.l 1 19991 ') . It has been argued that the strin- 
gent mass range might be responsible for the large 
scatter that has been ob served for n-capture elements 
l|Wanaio fc Ishimarul I2OO6I ). The homogenizing effect of 
n-capture element evolution is less apparent than for the 
Q-elements that are produced by the whole mass range of 
core-collapse supernovae. 

In contrast, for the s-process, the neutron flux is 
not so intense and the n-capture rate is compara- 



stars (for a review, see |Busso. Gal 


ino & Wasserburalll999l: 


iHerwid I2OO5 


: KaDDeler et al. 


2011 


) with the input of the 


"C pocket ( 


Iben fc Renzinill 


1982|; 


HoUowell fc Ibenll 19881"). 



However, the progenitor mass range, metallicity depen- 
dency and the impact of rotation l|Langer et al.l 1 19991 : 
iMevnet fc Maedeij I2OO2I ) are still uncertain. In the s- 
process, the maj or proposed neutron sources are from 
^C(a , n)^^0 fe.g.lKappeler et a"l]|l990l : IStancliffe fc Jeffervl 



I2OO7I : ICristallo et al.l |2009| ). " More massive stars (> 
4 M0) achieve higher temperatures and another neu- 
tron source ^^Ne(a, n )^''Mg could be dominant l|lbenlll975l : 
iTruran fc IbenI 1 19771 ). Stars in the mass ra nge of 8 - 
12 Mp mi ght evolve into super AGB phase f Siesd l2006l . 
I2007l . l2010l '). However, the s-process in super AGB phase 
is still yet to be understood. S-process also operates in 
massive stars that do not go through AGB/super AGB 
phase. This weak s- process can produce elements signifi- 
cantly up to Sr (e.g. jPrantzos. Hashimoto fc Nomotdll990l : 
IPignatari et al.ll2010l ). Due to the long time-scales of low- 
mass AGB stars, it is believed that AGB stars will not 
contribute significan t ly below metallicity [Fe /H] = —2 (e.g. 
iRoederer et al.ll2010l : [Kobavashi et al.ll201ld '). However, the 
contribution of AGB stars can appear even at low metallic- 
ity in the form of some classes of carbon enhanced metal- 
poor (CEMP) stars through binary mass transfer (|Mc 
19841: I Johnson fc Bolt j l2002bl. l2004l: iLucatello et al.l | 200a 



Sivarani et al.ll2004l: iGoswami et aLlbOOfil : lAoki et all 1200 . 
2008; Ma sseron et al.ll20ld '). The AGB star ejecta is trans- 
fered to the companion star and causes them to be en- 
rich ed in carbon and s-pro c ess elements (s e e models from 
Stancliffe fc Glebbeekl I2OO8I : IStancliffd l2009l : iBisterzo et aP 
I2OO9I ). In addition to n-capture element production from the 
s-process, low-mass star s also produce a small amount of Mg 
and O in the dredge-up (jMarigdl200ll : iKarakas fc Lattanzid 
l2003h . and might pollute the interstellar medium (ISM) with 
light er elements due t o the mass loss from the ou ter envelope 
(e.g. iReimerd Il977l : IVassiUadis fc WoodI Il993l ). However, 
these contributions t o the ISM are marginal, especially a t 
high metallicity (e.g. IKobavashi et ak I l200d : iKaraka^bOld V 
For the elements at A ~ 90 includi ng Sr, Y and Zr, the 
origin is even rnore problematic (e. g. iHonda et akl l2004bl : 
lAoki et akl l2005l : ICowan et akll2005l '). The observed abun- 
dance ratios of n-captu re elements suggest th e overproduc- 
tion of these elements. iTravaglio et al. (|2004l ) showed that 
this overproduction cannot be explained by the weak s- 
process alone and named it as the Lighter Element Pri- 
mary Process (LEPP). It has been shown that these ele- 
ments could be pro duced by collapse of rotating mass ive 
stars, i.e. coUapsars (|Pruet. Thompson fc Hoffmadl2004l ) or 
the weak r-process with slightly low Ye matt er that is nat- 
urally expected in core-coflapse supern ovae jlzutani et al.1 
l2009( i. iQian fc Wasserburd (|2007l . I2OO8I ') pointed out that 
these elements might be forme d through charged particle re- 
action (CPR). More recentlv. [Sovd et al.1 (|201ll ') suggested 
that this anomaly could be explained by truncated r-process 
for massive stars, in which the produced heavier r-process 
elements are consumed by the collapse of neutron star to a 
black hole. 

Light elements (Li, C, N, O, Na) will be de- 
pleted or internally-mixed during their Red Giant Branch 
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(RGB)/A GB phase either from t he CNO cycle, the NeNa 
cycle (e.g. lAndrievskv et al. 2007) or the ho t bottom burn- 
ing process (e.g. iKarakas fc Latt anzio* 2003). For instance, 
Li in giants will be depleted |Norris ct al. 1997; Spite et al. 
I2OO5I : iBonifacio et al.ll2007l : ISbordone et"llll201ol ) . whereas 
N will be enhanced at the expense of C and O. To avoid 
complication due to internal mixing and to better study the 
evolutionary state of the ISM when the stars were formed, 
we will leave the discussion of Li, C, N, O, and Na to a later 
paper. 

We will also study the C-space properties for some dwarf 
galaxies, because dwarf galaxies are often proposed as build- 
ing blocks for larger galaxies like the Milky Way (for a gen - 
eral review on local group dwarf galaxies, see lMatedll998l ). 
The chemical properties of the dwarf galaxies that are or- 
biting the Milky Way, such as Sculptor, Fornax, Sagittar- 
ius, Sextans and the LMC0 (Shctronc, Cote & Sargent 2001 
Shetrone et al.l 



_ , .20031 : Gcislcr ct^ l2005l : iSbordone et all 

2007^ reveal that the present-day dwarfs are chemically too 



dissimil ar to the Milky Way to be rea listic building blocks 
(but see lFrebel. Kirbv fc Simo"nll2010bl ). However the ultra- 
faint dSph galajcies such as Coma Berenices, Hercules, Leo 
IV, Leo T, Ursa Major I and II have abundance patterns 
more like the pattern of the Galactic metal-poor halo and ap- 
pear t o be plausible bui l ding blocks for the h alo l|Kirbv et al.l 
l2008l : lKoch et al.ll2008l : iFrebel et al.ll2010al ). ahhough their 
baryonic masses are very small. 

We briefly summarize the difference in abundance pat- 
terns between the brighter dwarf gala xies and the Milky 
Way; for a rnore complete discussion, seelVenn et all (|2004 ); 
IVenn fc Hil] l|2008l );|Tolstov. HiU fc 'IbsH ((2009'). Some dwarf 
galaxies show lower [a/Fc] at —2 < [Fe/H] < —1 than 
the metal-poor stars in the solar neighbourhood, but reach 
the same metal-poor plateau ([a/Fe] = 0.3 - 0.5) at 
[Fe/H] < —2. This is often explained by the slower star 
formation histories (SFH) of the dwarf galaxies, so SNe 
la and AGB stars that have longer time-scales can con- 
tribute more to the chemical evolution at lower metallic- 
ity than they do for the nearby metal-poor stars. This pro- 
duces a 'knee' in the [Q/Fe]-[Fe/H] diagram but at lower 
[Fe/H]. However. [Kobavashi fc Nomotol (f2009l ) argued that 
if this is due to the SNe la contribution, [Mn/Fe] should 
also show an increasing trend at the same [Fe/H], while 
[Mn/Fe] ratios in dwarf galaxies arc as low as in the Galactic 
halo stars ()Mc William , Ri ch fc Smec ker-Hanc 2003). They 
claimed that the low [a/Fe] and low [Mn/Fe] abundances 
patterns in the dwarfs are more consistent with the lack of 
massive star contribution. Or perhaps both the slower SFH 
and the lack of massive stars play a role in explaining the 
chemical profile of the dwarf galaxies. 

The slow SFH also coincides with the observed over- 
abundances of heavy s-process elements (/is) such as Ba 
and La relative to their Galactic counterparts at [Fe/H] 
> —1.5, since there is more time for the hs elements 
to be produced by the s-process . On the o ther hand, 
at low metallicit y [Fe/ H] < -2.5, lAoki et all (|2009l ) and 
iTafelmever et al.l l|2010h found that dwarf galaxies do not 



^ Some exc lude LMC from the dwarf galaxies category (e.g. 
lMateoll99S^ . However, for simplicity, we do not distinguish fainter 
satellite galaxies and dwarf galaxies in this paper 



show over-abundances of [/is/Fe] and tend to follow the 
Galactic halo trend. They argued that, at this metallic- 
ity, all of the n-capture elements are produced by the r- 
process. The light s-process elements (/s) such as Sr, Y and 
Zr do not however show over-abundances for dwarf galax- 
ies at [Fe/H] > —1.5, and are sometimes under-abundant, 
similar to [a/Fe]. This is not unexpected. Although the s- 
process in Galactic low-mass AGB stars produces both Is 
and hs elements (^Herwig 20051 ; iKappeler et al.ll201ll '). it has 
been proposed that the relative under-abundances of the Is 
elements could be ex plained by the metallicity-dependenc e 
of the s-process (Tols tov et"alll2009l ; [Kappeler et aLlbOllI ). 
Due to the primary nature (i.e. independent of metallicity) 
of the ^^C pocket as the major neutron source, theoretical 
studies suggest that metal-poor AGB stars will preferen- 
tially produce heavier s-process elements due to the high 
neutron-to-seed ratio at low metallicity (Gallino ct al. 199a; 
iBusso et al.ll200ll ; ICristallo et "ai]|2009l : [Bisterzo et al.ll20l'^ . 



3 DATA SELECTION 

Different mechanisms contribute to different metallicity in- 
tervals. For example, mechanisms that are associated with 
more massive progenitors will preferentially contribute at 
lower [Fe/H] due to the short evolution time, whereas the 
s-process in low-mass AGB stars will only contribute at 
higher [Fe/H] due to their longer evolution time. The C- 
space dimensionality and its interpretation may then be dif- 
ferent in different metallicity ranges. Therefore we separate 
our discussion according to two major metallicity intervals; 
-3.5 < [Fe/H] < -2 and -1 < [Fe/H] < 0. These correspond 
roughly to the low metallicity halo and the high metallicity 
(thick + thin) disks of the Galaxy. In some cases we will 
also study the intermediate interval of —2.5 < [Fe/H] < —1. 
These metallicity ranges are just a rough guide and depend 
on the samples that we adopt from the literature. 

There are many observational st udies of elemen- 
tal ab undances in the literature ( Edvardsson et ahl 

19931; iMcWilliam fc Richl 



Nissen fc Schustd 1 19971. 



_ _ _ - - - - - - - Hanson et al.l . 

Prochaska et all 120001; 'israelian et al.''200l'; 'Carretta et idj 



McWilhaml 



2002; Johnson fc.Bohc 2002a; Stephens fc Bocsgaard 200j; 


Chen ct all 12002. l2003l; 


Nissen et all 12002. 12004. I2007allbl; 


Gratton et al.l 2003al b 


; iBensbv. Feltzing fc Lundstrom 


2003j;lBensbv et al.l200^; 


Akerman et al.l2004;lArnone et al. 


2005; Jonsell et al.''2005; 


Caffau ct al."2005b'; Asplund et al. 


2005 
200a 


a, 2006; Garcia Perez ct al. 2006; AUcndc Pricto et al. 
; IPreston et al. 2006; Fulbright. McWilliam. fc Rich 


2007 


; iLai et al. 2007.. .2008; .Ruchti et al. 2010l; Fuhrmann 


2011 
the 


), but the analyzing models, solar abundances and 
sources of stellar parameters vary from author to 



author. This may cause systematic differences in elemental 
abundances, and produce spurious dimensions in the PGA 
analysis. Therefore, where possible we do not use compila- 
tions of abundances from multiple sources. Many surveys 
are relatively restricted in the number of elements measured 
or number of stars (< 50) observed. Some surveys are 
strongly biased to sp ecific classes of met al-poor stars, such 
as CEMP stars (e.g. I Cohen et al.ll2006l ), r-enhanced stars 
or related to plan et hunting studies (e.g. ISousa et al. I l2008l ; 
iNeves et al.ll2009l ) and therefore not suitable for our study. 
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Figure 1. This figure shows the metalli city dist r ibutio n of each sample that we have adopted. For solar neighbo urhood stars, 
we ha v e low meta l licity stars froni iBarklem et al.l | |2005| '| and F irst Stars Sur vey; intermediate metallicity stars from lBurris et a\\ 
||200(]| '). iFulbrightl bOOOl. l2002h: a nd high metallicity sta rs from |Reddv et all 12003. 2006). We also have the Fornax dSph galaxy 
sampl e from lLetarte'etaL I tout ), the LMC sa mple from | Pom peia e r'aLril2008l ) . MW globular cluster compilation from lPritzl et all 
||2005| ) and MW open ciiisier compilation from ICarrera fc Pancinol ( 120111" The dashed lines show the median abundance of each 
sample. 



because our goal is to measure the overall dimensionality of 
the C-space. For each metallicity range, we focus on a few 
relatively unbiased, large samples of homogeneous data, 
and adopted their abundances directly from the original 
papers. For the open and globular clusters, we had no 
alternative but to use compilations. 



3.1 Low metallicity 

For solar neighbourhood low metallicity ha lo stars in this 
study , we use the observational data from Barklem et al.l 
ll2005ll and th e First S tars Survey (ICavrel et al.l 12004 



Francois et al 



Barklem et al 



20091 ). From 
stars in the 



we consider only stars m 
3.5 < [Fe/H] < -1.5. We exclude CEMP stars and 



120071 : iBonifacio et aU 
20051 ) . we consider only 

range " " 

blue stragglers as disc ussed below. In total, after culling, 
we have 231 stars from iBarklem et al.l (|2005l ) and 50 stars 

from First Stars Survey^ 

Blue stragglers jPreston &: Sned"enl |2000| ) and some 



classes of CEMP stars are believed to have suffered binary 
mass transfer (cf. Section [2]) and therefore might not re- 
flect the abundances profile of the ISM from which they 
formed. We exclude known blue straggle rs and CEMP stars 
with [C/Fe] > 1: this criterion is from [Beers fc Christliebl 
(|2005j ). Having made this exclusion, the dimensionality 
of C-space derived by our PCA method should be re- 
garded as a lower limit, in case the carbon enhancement 
of so me CEMP stars is not due to binary mass transfer 
(e.g. lAoki et al.l |2002| : iDepagne etall 120021 '). but to some 
distinctive astrophysical origins such as faint supcrnovae 
(Tsujimoto fc Shigcvamal I2OO3I: lUmeda fc Nomoto 200"3 : 
lKarlsson|[2CI03 : lKobavashi. Tominaga fc Nomotdl2011bl ) or 
massive rotating stars ( Mevnet et al.l 20061 ) . 

As the metallicity decreases, departures from Local 
Thermodynamic Equilibrium (LTE) are expected to be - 
come more pronounced (for a review, see lAsplundl[2005bh . 
Therefore Non-LTE (NLTE) calcul a tion are neede d (e.g . 
iBaumiieller. Butler fc GehrenI Il998l : ICratton et akl Il999l : 
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Mashonkina & Gehren 200ll: Korn. Shi & GehrerJ 


2002 


Takeda et alj 12003: Andrievskv et al. 


20091. 201G|. 


2011 


Bereemann. Pickcrine & GehrerJ l2010a 


). However, note 



2010l: lAndreuzzi et all I2OIII : IJacobson. Friel fc Pilachowskil 



2011f ) are suitable objects. 



that iBar klc m ot al.. (,201lf ) cautioned that estimating the in- 
elastic hydrogen atom colhsions with Drawin's formula in 
these studies might not be appropriate. As a large homoge- 
neous NLTE-corrected sample is not yet available, and com- 
bining heterogeneous samples will give spurious dimensions 
in the PGA analysis due to the systematic differences, we 
decided to use available homogeneous ID-LTE abundances. 
All the chemical abundances from ID-LTE models are taken 
directly from the original papers, with the exception of the 
element Al. 

We include Al in our analysis because, unlike Na and O, 
Al abundances do not suffer from significan t internal mixing 
in G alactic stars l|Andrievskv et al] l2008l : I Bonifacio et al.l 
I2OO9I ). However, Al abundances are affected by NLTE effects 
at low metallicity. We a dopted a -1-0.6 NLTE correctio n for 
[Al/Fe] as suggeste d by iBaumiieller fc GehrerJ (liggTl ) and 
ICohen et alJ (|2004l ). Our results would not alter if we were 
to exclude Al. Al is the only NLTE-corrected abundance 
in this study. Our results should be reviewed once a large 
homogenized NLTE-corrected sample is available. 



3.2 Intermediate, high metallicity 

For solar neighbou rhood int ermediate metallicity stars, we 
chos e to use t he iBurris et al. ( 2000 ) sample (70 stars) 
and iFulbrightl l|200d. l2002i rsample ( 178 stars). F o r high 
met allicity disk stars, we use the iReddy et alj (|2003l ') 
and iReddv. Lambert fc AUende Prietol ( 20061 ) samples (357 
stars). These samples are adopted without further modifica- 
tion beyond culling obvious outliers (e.g. [Al/Fe] > 1, [V/Fe] 
> 0.8, [Co/Fe] > 0.7, [Nd/Fe] > 5, [Eu/Fe] > 5). 



3.3 Dwarf galaxies 

To compare the state of chemical evolution in the Milky 
Way and fainter satellite galaxies, we study the Fornax dSph 
galaxy using the lLetarte et all (|201(]| ) sample (80 stars). We 
exclude the star B058 because it is a metallicity outlier 
([Fe/H] = -2.58) for this sample (-1.2 < [Fe/H] < -0.6). 
We also studie d the LMC, us i ng th e homogeneous sample 
(57 stars) from lPompeia et alj (|2008h . For dwarf galaxies, to 
our knowledge, these are the only public available datasets 
with large (> 50 stars) homogeneous samples and elemental 
abundances measured at high resolution. 



No large homogeneous survey of globular clusters in- 
clu ding a wide variety o f elements is available to date (but 
see ICarretta et"ai] |2009| . for a homogeneous survey with a 
restricted number of elements). We ha ve to use the com- 
pilation of iPritzl. Venn fc IrwinI (120051 ) from different au- 
thors. In this compilation, mean abundances for each globu- 
lar cluster were derived. Since we do not study light-elements 
like C, N, O, Na that have been shown to have star-to- 
star dispersion within a cluster, other elements should have 
small star-to-star dispersion within a cluster and therefore 
it is justified to take mean abundances for most of the el- 
ements in this stud y . How ever, we are aware that recently 
iRoederer fc SnedenI (|201ll ) have shown star-to-star disper- 
sion in heavy n-capture elements (like La, Eu) and there- 
fore taking mean abundances of each globular cluster will 
only give a lower limit of the C-space dimensionality. We ex- 
clude objects identified with a dwarf spheroid al galaxy de- 
bris st ream, such as Rupl06, Pall2, Ter7 (e.g. iGaffau et alj 
l2005al ). and M68. Metallicity outliers are excluded by re- 
stricting [Fe/H] to the range —2.5 to —1. This leaves us 
with a total of 33 clusters. 

We also study a rece nt open clusters compilation from 
iGarrera fc Pancind (|201ll ) (private communication). We ex- 
clude those open clusters that have only [Fe/H] measure- 
ments. For clusters with multiple measurements, we take 
the mean abundance for each element. The sample then 
contains a total of 78 clusters, with Galactocentric radii 
6.4 =^ re < 20.8 kpc and -0.57 [Fe/H] < 0.41. For 
this compilation, we found that different model parameters 
lead to systematic differences of up to about 0.1 dex. Other 
sources of systematics such as differences in model atmo- 
spheres and methodology (e.g. equivalent width vs. spec- 
trum synthesis), are difficult to quantify but are likely to be 
smaller. Thus we estimate that the systematic differences 
can be up to 0.2 dex depending on the elements. 

Since we will not perform PGA combining multiple 
samples other than globular clusters and open clusters, 
and the solar abundance differences for elements heav- 
ier than Na are usually small (e.g. Anders fc Grevessd 
19891: iGrevesse fc Sauva lll998l.lAsplund. Grevesse fc SauvaJ 



2005cj : lAsplund et alj l2009l : iLodders. Palme fc Gaill bOOgl ). 

we do not homogenize the solar abundances adopted by dif- 
ferent authors. The metallicity distribution of each sample 
is shown in Fig. [T] and we summarize our adopted samples 
in Table □ 



3.4 Globular and open clusters 

The halo/disk star samples are mostly in the solar neigh- 



bourhood with 7.5 < 



ra 



< 8.5 kpc, where ra is 



the distance from the Galactic centre. We would like 
to know whether stars over a larger Galactic volume 
would give us more inhomogeneity and therefore more 
independent dimensions in the C-space. To pr obe wider 
regions , glob ular clusters ( Searle fc ZinnI 19781') . r noving 
groups (e.g. iDe Silva et all l2007l: iBubar fc Kin3 120101) 
and open clusters fe.g.jYong. Carney fc Teixera de Almeida 



I2OO5I : iFriel, Jacobson fc Pilachowskil l2oTor iPancino et al 



4 ANALYSIS METHOD 



4.1 PCA 



We start with the C-space defined by the set of element ratios 
[Xi/Fe]. We form the matrix of the correlation coefficients 
between all pairs of element ratios [Xi/Fe], [X^/Fe] and di- 
agonalize this matrix. One can show (Appendix|X]) that the 
eigenvector corresponding to the largest eigenvalue is the 
direction where we have the largest variance in the mean- 
shifted, normalized data set and so on for the successively 
smaller eigenvalues. Furthermore, the variances along those 
directions are given by the corresponding eigenvalues. Note 
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Table 1. Summary of adopted samples in this study. 



References 



Categories 



Count* 



MW Solar Neighbourhood 
Barkle m et al. (2005) 
First Stars Surveyt 
Reddv et al. (2003. 2006) 
Fulbright (2000. 2002) 
Burris et al. (2000) 
Others 

Pritzl et al. (2005) 
^arrer^^^ancino (201V) 
Letarte et al. (2010) 
Pompeia et al. (2008) 



Metal-poor halo stars 


231 


Metal-poor halo stars 


50 


Metal-rich disk stars 


357 


In between 


178 


In between 


70 


MW globular clusters 


33** 


MW open clusters 




Fornax dSph galaxy 


80 


The LMC 


57 



* After restricting mctallicity range; culling outliers, CEMP and 

bl ue stranglers 

t ICa.vrel et al.l l|2004 ): iBonifacio et al ] l|2003); I Francois et al.l 
1I2OO7I ) 

** Number of clusters 



that the matrix is symmetric, therefore the eigenvectors are 
orthogonal. These eigenvectors are the principal components 
and we can, to some extent, interpret these eigenvectors in 
terms of nucleosynthetic processes. We illustrate the proce- 
dure with toy models. 



4.1.1 Toy models 

Take a 3-dimensional space defined by three elements which 
we denote as Ell, E12 and E13, and assume that there are 
two element-producing mechanisms. The first produces ex- 
actly the same amount of Ell, E12 and E13 and the second 
mechanism only produces Ell and E12 with production ra- 
tio E11/E12 = 7. Figs. [2] and [3] illustrate the cases 7 = 1 
and 7 = 2. For each figure, panel (a) shows the scenario 
where only mechanism 1 is working: all elements are pro- 
duced in exactly the same ratio. Panel (b) shows the scatter 
plot with both mechanism 1 and mechanism 2 contributing 
to the element abundances: e.g. in panel (b), the points have 
now been randomly translated in the direction (1,1,0) by the 
action of the second mechanism. 

The first principal component of the PCA analysis, 
shown in panel (b) as the blue solid line, represents the di- 
rection that has largest variance in the normalized, mean 
shifted C-sp&ce. After determining the first component (with 
the largest eigenvalue), the PCA machinery then projects 
orthogonally all data points on to the hyperplane normal 
to the first principal component, as shown in panel (c) and 
panel (d). 

The second principal component is then the direction 
in the hyperplane that shows the largest variance among 
the projected points. Panel (d) shows the data points pro- 
jected on the hyperplane of the first principal component. 
The second principal component is shown as the red solid 
line. The composition of the normalized eigenvectors in the 
3-dimensional C-space are shown in the bar chart (e), with 
the first eigenvector as the three bars on the left and the 
second eigenvector as the three bars to the right. The first 
eigenvector has approximately equal components in each ele- 
ment and represents the contribution of mechanism 1. Mech- 



anism 1 has equal components, but the eigenvector shows 
small departures from equality which come from projections 
of random amounts of mechanism 2 on to this eigenvector. 

The first eigenvector has a clear interpretation: in the 
bar chart (panel e), the contributions from mechanism 1 
have similar components for all elements, as expected. The 
second principal component represents the contribution of 
mechanism 2. In the bar chart. Ell and E12 have the same 
sign but E13 has the opposite sign. It is important to note 
that, although the second mechanism does not produce E13, 
the contribution of E13 to the second principal component is 
not zero. This is intuitively clear, because the second com- 
ponent must be orthogonal to the first and must lie in the 
hyperplane shown in panel (c). The direction where we have 
both positive Ell and E12 in the tilted hyperplane is point- 
ing in the negative direction of E13. This illustrates an issue 
with interpreting the eigenvectors in terms of nucleosyn- 
thetic processes in more realistic systems. Although both 
principal components are well defined, the first eigenvector 
has a clear interpretation, but the interpretation of the sec- 
ond and later eigenvectors is less straightforward. 

Comparing the second eigenvector for 7=1 and 7 = 2, 
Ell and E12 show similar contributions for 7 = 1, but for 
7 = 2 the contribution from Ell is larger. This is not sur- 
prising; as we have seen in panel (d), the second eigenvector 
on the hyperplane shows a ratio of Ell to E12 > 1 when 
7 = 2. As before, although both principal components are 
well defined by the PCA, the first eigenvector has a clear in- 
terpretation but it is less obvious for the second eigenvector. 
When one component is in the positive diagonal direction 
(i.e. all elements contribute with the same sign) and the sec- 
ond eigenvector components show opposite signs of different 
families (here Ell & E12 vs E13), this suggests that there 
is contribution from a first mechanism that produces both 
families, and contribution from a second mechanism that 
preferentially produces Ell & E12 or preferentially produces 
E13 but we cannot tell which without further diagnostics. 
Section [5. II gives an example of a situation in which further 
diagnostics can illuminate the interpretation of second and 
later eigenvectors. 



4.1.2 Dealing with incomplete data sets 

Given an incomplete data set (i.e. some of the element abun- 
dances are not available for all stars), in principle we can still 
calculate Pearson's correlation for any two chemical abun- 
dances by using only the data points that have both chemical 
abundances, and therefore we can construct the correlation 
matrix entry by entry. However in this case the correlation 
matrix is clearly not a Gram matrijfl and therefore might 
not necessarily be semi-positive-definite and we might have 
unphysical negative eigenvalues. 

This is a well-known pr oblem in finance ana l ysis. W e use 
the algorithm suggested bv lRebonato fc Jackej (|l999h . The 
idea is to search for a closest semi-positive-definite matrix 
C that resembles the correlation matrix (cf. Appendix [BlT 



^ A n X ?i matrix is Gramian if and only if for every i,j £ 
{1,2, ...,n}, the (i,j)-th entry is given by the inner product 
{vi , Vj ) of the same set of vectors vx,V2, . . . , Vn 
* Another method to deal with missing data is by guessing the 
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(c) 



(a) 




0.0 El 3 



(b) 




(d) 




(e) 



■ Component 1, Eigenvalues 

■ Component 2, Elqenvolues 


= 2.85, Cumul. % = 94.99 
= 0.15, Cumul. % = 100.0 


III 






"l 



Ell EI2 EI3 Ell EI2 EI3 



Figure 2. This is a model (7 = 1) to illustrate how PCA works. The vertical lines in panel (a) and (b) are visual guides. Panel 
(a) shows the scatter plot of the chemical abundances where only mechanism 1 is working (see text). Panel (b) is the same as (a) 
but both mechanisms 1 & 2 are contributing to the chemical abundances. Panel (b) also shows the first principal component in blue 
solid line and the second principal component in red solid line. Panel (c) shows the the hyperplane of the first principal component in 
grey and the second principal component in red solid line. Panel (d) shows the data points projected onto the hyperplane of the first 
principal component and the red solid line is the second principal component. Panel (e) shows the composition of the first two normalized 
eigenvectors. Colour version of all figures in this paper are available in the online version of the journal. 
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We can measure the differences between these two matrices 
using the canonical matrix norm or the quadratic sum of the 
differences of the eigenvalues (cf. Appendix |B]). Throughout 
our study, the differences of either norm over the number of 
entries gave value <^ 1 and therefore the results are robust. 

After orthogonally diagonalizing the matrix C, we rank 
the eigenvalues in decreasing order and calculate their cumu- 
lative percentages. The cumulative percentages of ranked- 
eigenvalues represents the accounted total variances of the 
data cloud. 

4-1.3 Best cut-off for ranked-eigenvalues cumulative 
percentages 

Given that the measurements are not perfect, what percent- 
age of the variance of the data cloud is due to measurement 
uncertainty? For example, Section l5.ll shows the cumulative 
percentages for the ranked-eigenvalues for the n-capture el- 
ements at low metallicity abundance. The first component 
provides about 80% of the variance. The next two provide 
almost all of the rest of the variance. Our goal is to find 
the dimensionality of the C-space. How many of the prin- 
cipal components should we accept as real? We attempted 
to answer this question by performing Monte Carlo simu- 
lations, in which we created mock data sets lying in an n- 
dimensional spaces and tracing a non-tilted m-dimensional 
flat manifolqj. The mock data set has a spread of A = 0.3 to 
0.7 dex mimicking the real situation with element abundance 
distributions. We performed the simulation by rotating the 
manifold with a random orthogonal matrix. For each sim- 
ulation, we added 0.1 dex of measurement uncertainty and 
then performed PCA on the noisy data. We recorded the 
cumulative percentages corresponding to the m-th principal 
component. In this way, we tried to estimate the cumulative 
percentage at which we should stop accepting principal com- 
ponents as real, in order to deduce the correct m-dimensions 
of the manifold. 

For each set of values of n and m, we performed 10000 
simulations. We varied n, m and A in the range n £ [7, 15], 

missing data from the best fit model, where the best fit model 
is deduced from the rest of the data. However in this case, un- 
avoidably, we will strengthen our correlation unnecessarily. This 
illustrates a problem with most of the manifold learning methods 
such as Locally Linear Embedding (LLE), ISOMAP, Diffusion 
Map etc. These methods require a complete data set. Therefore 
there is no alternative to guessing the missing data. We argue that 
the method that we introduce here is a more democratic way to 
deal with missing data without imposing any prior. 
^ A m-dimensional flat manifold in n-dimensional space is the 
generalization of a 2-dimensional plane in 3-dimensional space. In 
this study, we made two assumptions: (1) If correlated, [Xi/Fe] 
are correlated linearly; (2) The errors of [Xi/Fe] are Gaussian- 
distributed. The first assumption is reasonable because [Xi/Fe] 
are abundances in log scale- the assumption holds if Nx- oc 
NV} , for all m E M, where Nx is the abundance of element 
X. Furthermore, our study samples are always restricted to a 
small metallicity abundance range. To justify the second assump- 
tion, w e compared the [Fe/H ] of stars in common of RAVE 
Survey jSteinmetz eta l.ll2006l ) and Geneva-Copenhagen Survey 
jNordstrom et al.l bOolT We found that the differences can be 
approximated by a Gaussian distribution. This suggests that the 
abundance errors in log scale could be Gaussian-distributed. 



m £ [2,5], A G [0.3,0.7]. The simulation showed that a 
reasonable cutoff for identifying real principal components 
in the PCA analysis is about 85%. 

In summary, assuming a cosmic spread of A — 0.3- 
0.7, measurement uncertainty of 0.1 dex and a flat manifold 
(which can be tilted), the rank of the eigenvalue correspond- 
ing to the cumulative percentage of about 85% gives a ro- 
bust estimate of the independent dimensions of the chemical 
space. The remaining variance of the data cloud comes from 
measurement uncertainty. If the measurement uncertainty is 
less than 0.1 dex, we should take the cut off larger than 85% 
and vice versa. In this study we will perform PCA analysis 
on the random variables [X/Fej3, where X can include Al, 
Sc (light odd-Z elements); Mg, Si, Ca, Ti (a-elements), V, 
Mn, Cr, Co, Ni, Zn (Fe-peak elements) and Y, Zr, Ba, La, 
Nd, Eu (n-capture elements). Among n-capture elements, we 
consider Y and Zr as light s-process (Is) elements, Ba and 
La as heavy s-process (hs ) elements and Nd, Eu as mostly r- 
proce ss elements (refer to lArlandini et al ] |1999I :Ib urris et al.l 
This choice of elements is determined by those in the 
available abundance surveys (see Table 1). The HERMES 
survey will include some other elements, and their contribu- 
tion is discussed later. 

It is important to note that the assumption A — 0.3-0.7 
might not hold for some elements, such as Ni. Our simula- 
tions show that if the intrinsic cosmic scatter for all [X/Fe] 
is 0.3 dex or less, the simulated noise will dominate over 
the intrinsic cosmic scatter and the the estimated cut-off 
will start to drop significantly (about 70%). However, this 
will not qualitatively alter our conclusion; for n-capture el- 
ements subspace, the cosmic scatters of n-capture elements 
are believed to be larger than 0.3 dex. For the all elements 
space, we only interpreted the first four eigenvectors, for 
which the cumulative eigenvalues percentage is about 75%, 
cf. Section O 



4.2 Estimate of intrinsic correlation 

For typical samples of stellar abundances with typical mea- 
suring errors, we showed in Section 14.1.31 that we can take 
as real the eigenvectors or principal components contribut- 
ing to the first 85% or so of the cumulative percentages for 
the ranked eigenvalues. Now we address a different issue. 
We may need to compare two samples which have different 
measurement uncertainties. For example, when comparing 
the principal components for abundances in dwarf galaxies 
and the Milky Way, it is important to find a way to correct 
for the different measuring uncertainties. 

We can do this by estimating via simulations the intrin- 
sic correlations for each sample, i.e. the values of the correla- 
tions if the measurement were perfect, without uncertainties. 
Then we can directly compare the principal components for 
each sample. Having reduced the contribution of noise to the 
correlations, the level of the cumulative percentages for the 
real ranked eigenvalues is now larger than 85%, and close 



We chose to work in [X/Fe] space because all elements are 
highly correlated in [X/H], i.e. if we were to perform PCA on 
[X/H], the dominant dimension will consume more than 80% of 
the variance for all cases. 
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to 100%, though not exactly 100% because of the residual 
uncertainties in the noise reduction process. 

In the simulations, given two correlated elements Xi 
and X2, first we need to find the best fit line to [A'l/Fe] 
and [A'2/Fe] to create the mock data set. Instead of deriving 
the best fit line using weighted or bi-weighted linear least 
squares, we searched for the best fit line by weighted total 
least squares. For more details, refer to iKrvstek fc AntorJ 
l|2007ll . and Appendix O 

This is crucial to make sure that the best fit line is 
symmetric in fitting y to x and fitting x to y and therefore 
minimize the differences between estimating in the forward 
and reverse directions. Let {xk,yk) be the k-th data point, 
and Ux,k and Uy^k be the corresponding measurement uncer- 
tainties. In the special case = Uy^k ~ o", minimizing 

2_ 1 ^\{xk~Xkf , ivk^Ykf] 



is equivalent to minimizing the weighted orthogonal distance 
of the measured points (xk, yt) to the fitting line y — ax + b 
where {Xk, Yk) is the orthogonal projection of the data point 
{xk,yk) to the line. 

In the simulation, we start with perfectly correlated 
mock data points lying on the best fit line, as shown in 
Fig. [4] the red open circles. The black filled circles are the 
observed data. We then add simulated cosmic scatter on the 
red open circles. The updated mock data points are now 
denoted as blue open circles. We then add simulated mea- 
surement uncertainty (known amplitude) to the blue open 
circles and the updated mock data points are denoted with 
orange crosses. The mock data are adjusted until the mock 
data with simulated noise (cosmic scatter and measurement 
uncertainty) have the same scatter as the observed data, i.e. 
the program is iterated until the orange crosses have the 
same correlation as the black filled circles. The average of 
Pearson's correlation of the blue open circles from forward 
and reverse fitting is used as the estimated intrinsic correla- 
tion. For every element pair Xi and X2, we performed 100 
Monte Carlo simulations and adopted the median as the best 
estimate. This procedure works well for element pairs with 
measured correlations > 0.1 for which the correlations are 
probably real. Some element pairs have smaller correlations, 
and the procedure is not so obvious. 

The goal of much of our analysis is to compare the eigen- 
vectors and dimensionality for various domains of [Fe/H] and 
subsamples like the dwarf galaxies and star clusters. When 
the correlation between an element pair is small, its sign may 
not be correct because of noise. The Monte Carlo procedure 
described above will not change the sign of the correlation so 
cannot account for an error in sign of a small correlation. We 
therefore also explored the possibility that the true sign of 
the correlation is different from the one observed and then 
estimated what effect this has on the dimensionality; it is 
always very small. For the results reported below, we were 
guided by the larger stellar samples in adopting the sign of 
a small correlation for the Monte Carlo modeling. 



5 ANALYSIS RESULTS 

In this section, we show the PCA analysis results of n- 
capture elements alone and then all elements, including light 




-0.4 -0.2 0.0 0.2 0.4 

[Ba/Fe] 



Figure 4. A example of simulated mock data to calculate the in- 
trinsic correlation between [Y/Fe] and [Ba/Fc] for the high metal- 
licity sample, in metallicity range —1.2 < [Fc/H] < —0.6. Here we 
assume a measurement uncertainty of 0.05 dex for both [Y/Fe] 
and [Ba/Fe]. The symbols are explained in the text. The black 
solid line is the best fit line calculated using weighted total least 
square. The black dashed vertical lines are the la of the observed 
data points distribution, assuming Gaussian distribution. 



odd-Z, a-, Fe-peak and n-capture elements. We discuss stars 
of low metallicity and high metallicity separately, and then 
consider open clusters, dwarf galaxies and globular clusters. 

5.1 Low metallicity stars 

We first study the dimensionality of the low metallicity stars 
using the ranked-eigenvalues cumulative percentages. Fig. [5] 
shows the results of the Barklem's sample (solid line) and 
First Stars Survey sample (dotted line) considering firstly 
the n-capture elements (Y, Zr, Ba, Nd, and Eu) alone. The 
vertical dashed line shows the principal component dimen- 
sions that corresponds to 85% of the cumulative percentage. 
Both samples showed that the n-capture elements at low 
metallicity have a strong component that makes up almost 
all of the variances. 

We then examine the individual components. Fig. [6] 
shows the composition of the first two eigenvectors (first 
two principal components), i.e. the first two directions that 
are orthogonal and account for most of the variance in the 
normalized mean shifted C-space. The upper plot and lower 
plot show the results of the Barklem sample and First Star 
Survey sample respectively. The colour is coded as a rough 
grouping to differentiate families of elements from differ- 
ent major production sources: dark red represents light s- 
process elements, red stands for heavy s-process elements, 
and mostly r-process elements are coded as orange. For sim- 
plicity, on the X-axis we denote [X/Fe] as X in all such fig- 
ures in this paper. Since the eigenvectors are normalized, the 
quadratic sum of the five [X/Fe] abundances contribution for 
each principal component equals 1. 

In the first component, we see that all elements have the 
same sign. This illustrates that this dominant principal com- 
ponent is pointing in the positive diagonal direction in this 
5-dimensional C-subspace. We can infer that the first com- 
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Figure 5. This ranked-eigcnvalucs cumulative percentages of n- 
capture elements (Y, Zr , Ba, N d, and Eu) for the low metallicity 
samples— iBarklem et al] | |2005| ) sample (solid line) and First Stars 
Survey sample (dotted line). 



ponent represents a mechanism that produce all 5 elements 
simultaneously. This suggests that all of these n-capture el- 
ements are mostly produced by a dominant pr imary process 
at low redshift, which could be the r-process ([Truran et al.l 
l2002l : IWanaio fc Ishimarull2006l ). 



The second eigenvector points in the anti-diagonal di- 
rection of the light s-process elements (Y, Zr) and the heav- 
ier n-capture elements (Ba, Nd, Eu), i.e. this component 
represents a mechanism that produces mainly light s-process 
elements or mainly heavy n-capture elements. Fig. [7] helps 
to resolve the nature of this mechanism: we see that the 
light s-process elements are over-abundant at low metallic- 
ity, and therefore the former interpretation is preferred. This 
second principal component might correspond to the contri- 
bution of LEPP (iTravagho et al.ll2004l : iQian fc Wasserburd 
l2007l : llzutani et aLlbOOa T However, as shown in the ranked- 
eigenvalues cumulative percentages plot (cf. Fig.[5ll, the con- 
tribution of this component is small relative to the first prin- 
cipal component. We discuss this further in Section |S] 

To show the relative importance of the two components 
as a function of metallicity, we performed the PCA analysis 
on stars in [Fe/H] bins of 0.7 dex. We calculated the ratio of 
the second component eigenvalue to the sum of the eigenval- 
ues for the first two components to estimate the contribution 
of the second component to the overall n-capture element 
yield. The results is shown in the upper panel of Table (2] 
The same calculation was repeated for the First Stars Survey 
sample and is shown in lower panel of Tabled The fractional 
contribution of the second component appears to decrease 
with increasing metallicity. We have performed Kendall's r 
and Spearman's p tests: both tests showed that this trend is 
significant (< 5% probability of a false positive correlation). 

Note that La is not included in the analysis of n-capture 
elements, in order to compare the low metallicity results 
with those for high metallicity. La is not measured in our 
high metallicity Reddy's sample. However, we have tested 
our results by adding La or restricting ourselves to a smaller 
subspace. In either case, it does not alter the results quali- 



1.0 



0.5 



0.0 



-0.5 



1.0 



0.5 



0.0 



-0.5 



Component 1, Eigenvalues 
Component 2, Eigenvalues 



illll 



4.33, Cumul. % 
0.53, Cumul. % 



86.59 
97.15 



Zr Ba Nd Eu 




Zr Bo Nd Eu 



Component 1, Eigenvalues - 3.97, Cumul. % — 79.43 
Component 2, Eigenvalues - 0.63, Cumul. % = 92.03 



mil 



■II 



Y Zr Ba Nd Eu 



Y Zr Bo Nd Eu 



Figure 6. The composition of the first two n-capture elements 
normalized principal components, i.e. the eigenvector compo- 
nents of the n-capture elements correlation matrix, using the 
low metallicity samples: Barklem's sample (upper plot) and 
First Stars Survey sample (lower plot). The horizontal axis 
shows the 5 n-capture elements and the vertical axis shows 
the component of the eigenvector in the direction of each of 
the 5 elements.. See the text for the colour code. 



Table 2. Fraction of the second component contribution to the 
n-capture elements yield: Barklem's sample (upper panel), the 
First Stars Survey sample (lower panel). 



[Fe/H] 


range 


Fraction(%) 


(-3.4, 


-2.7) 


10.15 


(-3.3, 


-2.6) 


10.09 


(-3.2, 


-2.5) 


8.41 


(-3.1, 


-2.4) 


7.88 


(-3.0, 


-2.3) 


7.09 


(-2.9, 


-2.2) 


5.62 




[Fe/H] 


range 


Fraction(%) 


(-3.5, 


-2.8) 


18.10 


(-3.4, 


-2.7) 


16.78 


(-3.3, 


-2.6) 


13.67 


(-3.2, 


-2.5) 


10.96 
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Figure 7. Y (is) to Ba (hs) ratios as a function of [Fc/H] for the 
Milky Way hal o /disk stars from First Stars Survey {red crosses), 
iBarklem et~all I2OO5I) (blue diam o nds) , Fulbriglit (2000, 
(black filled circles) , \ReddY et alj ^2003, 2006) 
and globular clusters from lPritzl et al. (2005) (black open circles) 
All samples assume ID-LTE stellar model. 



2002) 
(cyan triangles) 



tatively. The rest of the principal components are probably 
due to measurement uncertainty since the first two compo- 
nents already account for > 90% of the data cloud variance. 

Now we calculate the principal components for the 
whole set of elements available for the low metallicity stars. 
The principal components of PC A analysis on 17 elements 
(Al, Sc, Mg, Ca, Ti, V, Cr, Mn, Co, Ni, Zn, Y, Zr, Ba, 
La, Nd, and Eu) for Barklem's sample are shown in Fig. [S] 
The colour is coded to differentiate categories of elements: 
the n-capture elements are coded as before, hght odd-Z el- 
ements are colour-coded with dark brown, a-elements with 
blue, Fe-peak elements (beside Cr & Mn) with green, and Cr 
& Mn with black. If we adopt a 85% cutoff for the ranked- 
eigenvalues cumulative percentages, as shown in Fig. |9l we 
find about 6 independent dimensions for the 17-dimensional 
C-space. The number of dimensions is not altered if we apply 
a -1-0.6 dex NLTE correction for [Al/Fe], as shown in Fig. |9l 

The first four principal components of this 17- 
dimensional C-space can be summarized as follows: 

• The first component shows contributions primarily 
from all n-capture elements and also contribution from a- 
elements. This suggests that the production sites of the 
r-process also produce significant amounts of a-elements, 
which may be consistent with core-collapse supernovae as 
the r-process site rather than neutron star mergers. The first 
component could be the contribution from core-collapse su- 
pernovae that involve the r-process. 

• The second component shows an anticorrelation of a- 
elements with Fe-peak elements and n-capture elements. 
This suggests that there is a mechanism that produces a- 
elements but does not produce Fe-peak and n-capture ele- 
ments. This component may correspond to the contribution 
from 'normal' core-collapse supernovae that do not involve 
the r-process 

• The third component shows an anticorrelation of q- 
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Figure 8. The normalized principal components of 17 ele - 
ments for the low metallicity sample jBarklem et al.ll2005h . 
[Al/Fe] is corrected for NLTE effect with -1-0.6 dex. The upper 
plot and lower plot show the first two principal components 
and the third and fourth principal components, respectively. 
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Figure 9. The ranked-eigenvalues cumulati ve percentages of 17 
elements for the low metallicity sample tBarkle m et al .I2OO5I) . The 
solid line and dotted line show the results with and without the 
NLTE correction of [Al/Fe], respectively. 
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Figure 10. The same as Fig. E] but without the NLTE cor- 
rection for [Al/Fe]. 



elements and Fe-peak elements with the n-capture elements. 
This suggests the production of both a-elements and Fe-peak 
elements. This compon ent may correspond to the contribu- 
tion of hypernovae (e.g. lKobayashi et al.ll200^ ) where signifi- 
cant amounts of Co, Ni, Zn are produced with high explosion 
energy. 

• The fourth component shows a strong contribution from 
Cr and Mn. Different from heavier Fe peak elements, Cr 
and Mn are synthesized in the incomplete Si-burning region, 
which is the outer region of the complete Si-burning region. 
We discuss this further in Section [S] 

Note that with the NLTE correction, Al does not con- 
tribute to the first four components. If we do not apply 
NLTE correction for Al, Al contributes to the second compo- 
nent as shown in Fig. 1101 but the results for other elements 
remain unaltered. We will discuss this further in Section [G] 



5.2 High metallicity stars 

Fig. [11] and Fig. [12] show the PCA analysis results for the 
n-capture elements (Y, Zr, Ba, Nd, Eu) in the Reddy high 
metallicity sample. This results show two marked differences 
from the low metallicity sample (cf. Fig. [5] Fig. |6]|: (1) The 
first principal component contributes far less (56%) to the 
overall variance compared with low metallicity sample (79 
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Figure 11. The composition of the first three normalized princi- 
pal components for the n-capture elements for the high metaUicity 
sample. 
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Figure 12. The ranked-eigenvalues cumulative percentages for 
the n-capture elements for the high-metaUicity sample. 



- 87%). It is clear that at high metallicity, there is more 
than one mechanism contributing to the n-capture elements; 
(2) At high metallicity, the second component is made up 
of both light s-process elements (Y and Zr) and heavy s- 
process elements (Ba), instead of light s-process elements 
alone as in the low metallicity sample anticorrelating with 
mostly r-process elements (Nd and Eu). This component 
may correspond to the s-process in low-mass AGB stars. 
The s-process in low-mass AGB stars, operating in metal- 
rich environment, is expected to produce both light s-process 
and heavy s-process elements. The third component might 
be the remnant of LEPP contribution at low metallicity, or 
it might just be noise, since the first two components already 
account for 88% of the total variance. 

Fig. [13] shows the first four principal components from 
the PCA analysis of all 17 elements (Al, Sc, Mg, Si, Ca, Ti, 
V, Cr, Mn, Co, Ni, Zn, Y, Zr, Ba, Nd, and Eu) which are 
available for the high metallicity sample. The colour coding 
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Figure 13. The normalized principal components of 17 ele- 
ments for the high metallicity sample. The upper and lower 
plot show the first two principal components and the third 
and fourth principal components, respectively. 



is the same as in Fig. [S] The results are summarized as 
follows: 

• The first component is a strong component, accounts 
for about 40% of the total variance, and is made up of 
a-elements and Fe-peak elements (except for Mn and Cr) 
as well as light odd-Z element Al (ID-LTE-based). Note 
that NLTE correction is supposed to be negligible at high 
metallicity. We recall that in the low metallicity sample, Al 
was absent from the first few principal components when 
NLTE-corrected. This component may correspond to the 
well-mixed ISM that is affected by many generations of core- 
collapse supernovae. The deficit of Mn suggests that this 
component does not include the contribution from SNe la. 

• The second component is made up mainly of all of the 
n-capture elements, which could be the contribution from 
r-process. However, unlike the first principal component of 
the low metallicity sample which also includes all of the n- 
capture elements, the a-elements contribution are not visible 
in this second component for the high metallicity sample. We 
discuss this in more detail in Section [H] 

• The third component shows anticorrelation between 
light s-process, heavy s-process, a-elements and the other 
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Figure 14. The ranked-eigenvalues cumulative percentages of 17 
elements for the high metallicity sample. 



elements. This could correspond to the contribution of AGB 
stars ejecta. 

• The fourth component is again dominated by Cr and 
Mn. We will discuss this further in Section |6l 

Adopting a 85% cutoff for the ranked-eigenvalues cu- 
mulative percentages, as shown in Fig. 1141 we again have 
6 independent dimensions in the 17 dimensional C-space, 
as for the low-metallicity sample. This is not totally unex- 
pected. Although dimensions are l ost by the homogeniza- 
tion of ISM at higher metallicity IjKarlsson fc GustafssonI 
I2OOII : [Bland-Hawthorn et al.ll2010l ). there are also processes 
such as the AGB stars and SNe la that will contribute only 
at higher metallicity. This extra 'birth-imprint' appears to 
compensate for the loss of dimensions due to homogeniza- 
tion. 



5.3 Open clusters 

Now we turn to the open clusters, which cover a larger vol- 
ume of the Galaxy than the stellar samples discussed so far. 
We do not make a separate study of their n-capture element 
subspace because there are no Eu data in the open cluster 
compilation. Eu is potentially important to separate the r- 
process and s-process contributions. Instead, we analyze the 
subspace of a-elements: Mg, Si, Ca, Ti. We then analyze the 
all-elements space, including the 13 elements: Al, Sc, Mg, Si, 
Ca, Ti, V, Cr, Co, Ni, Y, Ba, and Nd. We compare the re- 
sults for the open cluster compilation with the Reddy solar 
neighbourhood sample by restricting the metallicity range of 
the solar neighbourhood sample to [Fe/H] > —0.5 to match 
the range for the open clusters. 

The ranked eigenvalues shown as the solid lines in 
Fig. [15] and Fig. [16] suggests that the C-space for the open 
clusters has a higher dimensionality (about 1 more dimen- 
sion) than for the nearby metal-rich field stars, both for the 
restricted a-element subspace and for the 13-element space. 
However, systematic differences between authors may con- 
tribute extra scatter to the open clusters abundances and 
the extra dimension could be spurious. To show that this 
is not the case, we have performed the intrinsic correlation 
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Figure 15. The ranked-eigenvalues cumulative percentages of 
the o-elements, comparing open clusters (black lines) with solar 
neighbourhood stars {red lines). The solid lines show the observed 
correlation. The dashed/broken lines show the cumulative per- 
centages derived from the estimated intrinsic correlation matrix, 
assuming measurement uncertainty of 0.05, 0.07, 0.10, 0.15, and 
0.20 dex respectively for open clusters, and 0.02, 0.03, and 0.10 
dex respectively for solar neighbourhood stars. 

estimates as described in Section 14.21 The results are over- 
plotted in Fig. [15] and Fig. [I6l 

For example in Fig. 1151 the black dashed/broken lines 
show the ranked-eigenvalues cumulative percentages derived 
from the estimated intrinsic correlation for the open clusters 
sample, assuming that the measurement uncertainty is 0.05 
(short-dashed), 0.07, 0.1, 0.15, and 0.2 (long-dashed) dex, 
respectively. The red dashed/broken lines show the cumula- 
tive percentages derived from the estimated intrinsic corre- 
lation for the solar neighbourhood sample, assuming a mea- 
surement uncertainty of 0.02, 0.03 and 0.1 dex, respectively. 
Recall that for the estimated intrinsic correlation matrix, 
by definition, there is no residual contribution from mea- 
surement uncertainty. Therefore the cumulative percentages 
cutoff should be close to 100% instead of 85%. 

The results show that our previous interpretation holds 
even if we make extreme assumptions about the measure- 
ment uncertainty of the two samples, such as assuming mea- 
surement uncertainty and systematic differences of 0.2 dex 
for open clusters and a very small measuring uncertainty 
of 0.03 dex for the solar neighbourhood stellar sample. The 
result is robust for the 4-dimensional a-element subspace. 
This is consistent with results in the literature that the a- 
element distribution may not be the same within and out- 
side the solar neighborhood: for example the enhanced ra- 
tios of [Mg/Fe] and [Ti/Fe], bu t not for [Si/Fe] and [Ca/Fe] 
reported by lYong et al.l |200^ at larger galactocentric dis- 
tances. For the larger 13-dimensional C-space, the result 
seems secure but is a little less robust because of the con- 
cerns described in Section 14.21 about deriving the intrinsic 
correlations for weakly correlated element pairs. 

Our results are c onsist ent with 

iDe Silva. Freeman fc Bland-HawthornI (|2009l ') and lRoedererl 
1 20091 ) that the larger regions of the Galaxy covered by 
the open cluster sample may show more independent 
variations of element abundances than the immediate solar 
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Figure 16. The same as Fig. I15l but for 13 elements, and assum- 
ing measurement uncertainty of 0.05, 0.07, and 0.20 dex respec- 
tively for open clusters, and 0.03, 0.05, and 0.10 dex respectively 
for solar neighbourhood stars. 

neighbourhood, even though we cannot readily identify the 
nature of the extra dimension of their C-space. Although 
the solar neighbourhood appears chemically well mixed, as 
shown by the relative tightness of say the [a/Fe]-[Fe/H] 
relation, this may not prevail over larger volumes of the 
Galaxy. 

5.4 Satellite galaxies 

Now we investigate two samples of stars, one for the Fornax 
dSph galaxy and one for the LMC. These are of interest 
because they represent different evolutionary environments 
from the Milky Way, and also because older versions of these 
galaxies may be si milar to the systems which merged with 
the Milky Way (see lSearle fc Zinnll 19781 ). 

In the previous section, we showed that the C-space di- 
mensionality might vary as a function of the survey vol- 
ume. Therefore, robust comparisons can only be made if 
the survey volumes are about the same. Since our homo- 
geneous satellite galaxy samples are restricted to a small 
survey volume (1-2 kpc in diameteiF'T) . it seems appropri- 
ate to compare the C-space results for the satellite galaxies 
and the solar neighbourhood, for which the surveys cover a 
similar volume. For each comparison, we restricted the solar 
neighbourhood stars to the same metallicity range as For- 
nax (-1.2 < [Fe/H] < -0.6) and the LMC (-1.0 < [Fc/H] 
< -0.5). 

First we did a PCA analysis on the n-capture elements 
(Y, Ba, Nd, and Eu). The results is shown in Fig. 1171 Fornax 
appears to show more C-space dimensions than the Milky 
Way's solar neighbourhood from the observed correlation 
(solid lines). However, the extra dimension could be spuri- 
ous if the measurement uncertainties for the dSph galaxy 
are larger. To examine this possibility, we estimated the in- 

Letar te's study covere d 25' of the cent re of Fornax (about 1 
kpc, e.g. iRizzi et al.ll2007l '). The survey of iPompeia et al.l ||2008| 1 

covered 2 kpc of the LMC inner disk 
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Figure 17. The n-capture element ranked-eigenvalues cumula- 
tive percentages comparing the Fornax dSph galaxy {black lines) 
with its solar neighbourhood counterparts {red lines) in the same 
metallicity range. The solid lines show the observed correlation. 
The dashed/broken lines show the cumulative percentages de- 
rived from the estimated intrinsic correlation matrix, assuming 
measurement uncertainty of 0.1, 0.13, 0.15, and 0.2 dex respec- 
tively for the dSph, and 0.07 dex for the solar neighbourhood 
counterparts. 



- Component 1, Eigenvalues = 2.60, Cumul. % = 51.95 
1.0- Component 2, Eigenvalues = 0.81, Cumul. % = 68.12 
■ Component 3, Eigenvalues = 0.76, Cumul. % = 83.32 




Y Ba Lo Nd Eu Y Bo La Nd Eu Y Bo La Nd Eu 

Figure 18. The composition of the first three normalized prin- 
cipal components for the n-capture elements of the Fornax dSph 
galaxy. 

trinsic correlation as before and the results is over-plotted 
in Fig. [T71 

Note that iReddv et all (|2003l . \200dl ) reported measure- 
ment uncertaint y of 0.07 - 0. 1 dex for n-capture element 
abundances and iLetarte et al.l ([201(1) reported uncertainty 
of 0.15 - 0.25 dex. In the comparison with Fornax, we took 
the measurement uncertainty a for the solar neighbourhood 
sample to be 0.07 dex; if we were to choose cr = 0.1 dex, the 
PCA results for the solar neighbourhood sample in Fig. [T7] 
would further deviate from the Fornax sample. 

The simulation shows that, if we assume that the mea- 
surement uncertainty for Fornax to be 0.1 - 0.15 dex, then 
the Fornax C-space has more independent dimensions than 
the solar neighbourhood in the same metallicity range. How- 
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Figure 19. The ranked-eigenvalues cumulative percentages for 
the a-elements for the dwarf galaxies {black lines) and the solar 
neighbourhood {red lines) in the same metallicity range. The re- 
sults for Fornax and the LMC are shown in the upper plot and 
lower plot, respectively. The solid lines show the observed corre- 
lation. The dashed/broken lines show the cumulative percentages 
derived from the estimated intrinsic correlation matrix, assum- 
ing measurement uncertainty of 0.1, 0.15 dex respectively for the 
dwarf galaxies and 0.05, 0.1 dex respectively for the solar neigh- 
bourhood counterparts. 

ever, we cannot exclude the possibility that the extra dimen- 
sion is due to larger uncertainty for the Fornax sample. If 
we assume that the measurement uncertainty of the Fornax 
sample is larger than 0.2 dex, then Fornax's dimensionality 

is the about the same as the solar neighbourh ood. 

On the other hand, there is evidence that lLetarte et all 

(|201(]|) may have overestimated their measurement uncer- 
tainty. For example, in their fig. 15, the [Eu/Fe]-[Fe/H] and 
[Nd/Fe]-[Fe/H] relations are quite tight, and the scatter of 
the points appears at odds with the error bar shown. Thus, 
we contend that the extra apparent dimension of the C-space 
for Fornax is probably not due to a larger measurement un- 
certainty. 

The principal components of the PCA analysis for the 
Fornax n-capture elements (Y, Ba, La, Nd, and Eu) are 
shown in Fig. 1181 The first component consists of all ele- 
ments in the same sign as before. This component represents 
a mechanism that produces all n-capture elements simulta- 
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Figure 20. The same as Fig. llQI but for all elements in common. 



neously which probably involves the r-process. The second 
component shows an enhancement in light s-process ele- 
ments (Y in this case) which could be the remnant of LEPP. 
And the third component shows enhancement in heavy s- 
process elements (Ba and La), which may be the contribu- 
tion of the s-process in AGB stars operating at metal-poor 
environment. We postpone further discussion to S ection [6] 
For the LMC sample, iPompeia et all l|2008l ) did not 
measure Eu. As Eu is a crucial element to distinguish be- 
tween pure r-process and s-process, we did not study the 
C-subspace of the LMC n-capture elements. Instead, to- 
gether with the Fornax dSph galaxy, we studied the ranked- 
eigenvalues cumulative percentages of a-elements (Mg, Si, 
Ca, Ti) and all elements measured in common: (1) for the 
LMC and the solar neighbourhood- Sc, Mg, Si, Ca, Ti, V, 
Cr, Co, Ni, Y, Zr, and Ba; (2) for Fornax and the solar 
neighbourhood- Mg, Si, Ca, Ti, Cr, Ni, Y, Ba, Nd, and Eu. 
The results are plotted in Fig. [19] and Fig. [20] Both plots 
show that dwarf galaxies have slightly more independent C- 
space dimensions than the solar neighbourhood, even though 
we adopt a larger measurement uncertainty for the dwarf 
galaxies. This is consistent with our previous discussion and 
results. However, to draw a firmer conclusion, dwarf galaxy 
samples with measurement uncertainty smaller than in the 
current samples are urgently needed. 



5.5 Globular clusters 

Now we study the Galactic globular clusters, which are 
all old objects and cover a large range in Galacto- 
centric radius. The formation process of the globular 
clusters remains poorly un derstood (for a review, see 
iCratton. Sneden fc Carrettal li2004). and we are interested to 
know what can be l e arned from their C-space properties. We 
use the jPritzl et~aLl (120051 ) compilation which gives the mean 
abundances for each globular cluster. 

We compared n-capture elements (Y, Ba, La, and Eu) 
for globular cluste rs with solar neighbourhood stars from 
iBurris et al.l (|2000l ). and compared a-elements (Mg, Si, Ca, 
and Ti) and then all elements in common (Mg, Si, Ca, Ti, Y, 
Ba, and Eu) for globular clusters with solar neighbourhood 
stars from Fulbright (2000, 2002), restricting all samples to 
the same metallicity range: —2.5 < [Fe/H] < —1.0. 

The ranked-eigenvalues cumulative percentages for the 
n-capture elements are plotted in the upper panel of Fig. 1211 
At first sight, globular clusters {black solid line) seem to have 
more C-space dimensions than the solar neighbourhood {red 
solid line). As we have cautioned above, however, the sample 
of globular clusters is not homogeneous but rather a com- 
pilation from different authors. Therefore, we would expect 
more systematic uncertainty for the globular clusters sam- 
ple. We estimated the intrinsic correlation with the same 
method used in the open clusters and dwarf galaxies study, 
and the results is over-plotted in the lower panel of Fig. 1211 
The results show that if we assume a 0.1 dex measurement 
uncertainty for solar neighbourhood stars and 0.2 dex mea- 
surement uncertainty for globular clusters, the independent 
dimensions of the n-capture element space are about the 
same. On the other hand, when we examined the dimen- 
sionality for the a-element subspace and the whole set of 
available elements in common to globular cluster and solar 
neighbourhood samples, the dimensionality of the globular 
cluster space seems to be larger than for the solar neighbour- 
hood, as shown in the middle and lower panel of Fig. 1211 

The sample of globular clusters is small (33 clusters in 
the metallicity range —2.5 < [Fe/H] < —1) and the impact 
of measurement uncertainty is significant. This can be seen 
in Fig. [21] in which the estimated intrinsic correlation for 
globular clusters shows a significant decrease of their inde- 
pendent C-space dimensions. This illustrates that the sample 
size is crucial to determine a robust C-space dimensionality. 
We also note that our study is somewhat limited in its scope: 
for example, we recall th e Na/ O anticorrelation which is seen 
in globular cluster stars (jCarretta et al.|[2009l : lGratton et al.l 
2010) but not among the field stars, and note that Na and 
O are not part of this study. 

In summary, within the limitations of the comparison, 
the globular clusters appear to have slightly larger chemical 
space dimensionality than the solar neighbourhood stars in 
the same metallicity range. It is difficult to know whether 
this result should be expected, because we do not yet have a 
good perspective on what globular clusters are and how (and 
where) their chemical evolution took place. A larger and 
homogeneous abundances survey covering more elements of 
globular clusters would be very desirable. 
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Figure 21. The ranked-eigenvalues cumulative percentages for 
the globular clusters {black lines) and the solar neighbourhood 
stars {red lines) in the same metallicity range: n-capture elements 
{upper panel), a-elements {middle panel) and all elements in com- 
mon {lower panel). The solid lines show the observed correlation. 
The dashed/broken lines show the cumulative percentages de- 
rived from the estimated intrinsic correlation matrix, assuming 
measurement uncertainty of 0.1, 0.2 dex respectively for globular 
clusters, and 0.05, 0.1 dex respectively for the solar neighbour- 
hood stars. 



6 DISCUSSION 

6.1 The n-capture elements subspace 

6.1.1 The r -process contribution 

Severa l au thors including iHonda et al.l (|2004al ) ; llvans et al.l 
(120061 ) and lHavek et~ai] (|2009l ) noted that the r-process pro- 
duction ratio for the n-capture elements in [X/Fe] is almost 
constant despite the large scatter in the abundance of each 
element. Therefore, in the C-space, if the major contribution 
is from the r-process, one would expect the data cloud to 
form a tight straight line in the n-capture element space. 
It is pleasing to see that, at low metallic ity where the r- 
process should be dominant (|Truranlll98ll ). the PC A shows 
a strong first component made up of all elements positively 
correlated. This r-process component contributed > 80% of 
the data cloud variance for the metal-poor stars. This agrees 
with other conclusions from the literature. 

6.1.2 The overabundance of light s-process elements 

Observations have also reported more scatter in the ratio 
of light s-p rocess elements to heavy s-process elements such 
as [Y /Ba] jBurris et al.ll2000l:lNorris et al.ll200ll : llvans et all 



I2OO3I : lAoki et al.ll2005l : lHondaet al.ll2007l ) at [Fe/H] < -3. 
As shown in Fig. [71 the light s-process elements seem 
to be overproduced at lower metallicity relative to the 
heavier elements. Various mechanisms have been proposed 
to e xplain this observa tion including the weak r-process 
(e.g. Ilzutani et al.|[2009l ) and the CPR l|Oian fc Wasserburd 
l2007t ). Our PC A method clearly show the existence of this 
unknown source. For the low metallicity sample (Fig.[6|, the 
second principal component is made up of light s-process el- 
ements anticorrelated with heavier n-capture elements. This 
suggests a second mechanism which either overproduces 
light s-process elements or overproduces heavier n-capture 
elements. From Fig.[7l the former interpretation is preferred. 

Also, as shown in Table [2l the second primary source 
for light s-process elements contributes more as the metal- 
licity decreases. In Fig. [5l the First Stars Survey sample 
shows slightly more dimensions than the Barklem's sam- 
ple. This could be interpreted as a larger contribution from 
the LEPP since the First Stars Survey sample has a lower 
mean metallicity. This decreasing trend of LEPP with in- 
creasing metallicity suggests that the culprit for LEPP is 
restricted to low metallicity and/or massive progenitors. If 
this is due to the mass dependence, this is consistent with 
the proposed LEPP candidates- the weak r-process, CPR, 
coUapsars or the truncated r-process. The relative contribu- 
tion of the LEPP becomes smaller at higher metallicity and 
the production of n-capture elements is dominated by the 
main r-process and the s-process. The results in Table[2]also 
illustrates one of the strengths of the PCA method compared 
with the traditional [X/Fe] -[Fe/H] plots- one can quantify 
the contributions of different mechanisms. 



6.1.3 Low-mass AGB contribution 

At high metallicity, one would expect a contribution from 
low-mass AGB stars with initial mass range of 1.5 - 3 Mq. 
Unlike LEPP, the s-process in low-mass AGB stars produces 
both light s-process elements and heavy s-process elements 
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such as Ba (see lHerwi3l2005l : lKappeler et al.ll201ll 'l. but not 
so much of the mostly r-process elements such as Nd and 
Eu r elative to the r-process contribution (jArlandini et alJ 
Il999l ). 

In our PCA analysis, we find that the relative contribu- 
tion from the first component (i.e. the r-process) decreases 
at higher metallicity (Fig. [5] and Fig. I12|l . We also find that 
the second component is made up of both light and heavy 
s-process elements (Fig. Both are consistent with the 
s-process in low-mass AGB stars, which is dominant at high 
metallicity. 

To show that this transition between high metallicity 
and low metallicity is real, we have als o analyzed the inter - 
mediate metallicity sample taken from iBurris et all (|200d ). 
We separate the the sample into two extreme metallicity bin: 
-2.7 < [Fe/H] < -2 and -1.5 < [Fe/H] < -1. The former 
shows results that resembles Fig. [6] and the latter shows re- 
sults that is similar to Fig. 1111 This further confirms that 
the transition is indeed happening and occurs around — 2 < 
[Fe/H] < -1.5. 

6.2 Satellite galaxies 



In Section 15.41 we presented results for the dwarf galax- 
ies samples in comparison with the solar neighbourhood 
stars in the same metallicity range. Recall that in this 
metallicity range, many stars in the solar neighbourhood 
are likely to be thick disk stars, and the belief is that 
the thick disk had a brief and in tense star formation his- 
tory (SFH) (see also Chiappini. M atteucci fc Romano 200l] : 
iKobavashi et al.ll2006l l. On the other ha nd, the SFH is be - 
lieved to be slower for dwarf galaxies (e.g. lWeisz et al.ll201lh . 

In our study, our PCA method results are consistent 
with the speculation that dwarf galaxies have slower SFH. 
We showed that dwarf galaxies seem to have more inde- 
pendent C-space dimensions than the solar neighbourhood. 
This implies that the ISM in dwarf galaxies is impacted by 
a higher level of stochastic enrichment: the less vigorous his- 
tory of enrichment events in the dwarf galaxies may allow 
more chemical traces to persist. 

Now we will discuss the principal components for n- 
capture elements. Our PCA analysis shows two peculiarities 
of the n-capture elements in the dwarf galaxies. Firstly, the 
second principal component gives a significant contribution 
to the data cloud variance. We interpret the second principal 
component as the LEPP contribution (because it has a large 
contribution from light s-process elements). In contrast, for 
the solar neighbourhood stars in the same metallicity range, 
the LEPP contribution is hardly visible and only contribu- 
tions from the r-process and the s-process in low-mass AGB 
stars are seen. This is consistent with the view that the SFH 
in dwarf galaxies is slower and therefore retains more chem- 
ical substructure from the previous generation of ejecta. 

Secondly, the third principal component in this analysis 
was not seen before. It has heavy s-process elements (Ba, La) 
anticorrelated with the other n-capture elements, includ- 
ing the light s-process elements. Recall that previously at 
high metallicity, for n-capture elements in the Milky Way's 
solar neighbourhood, the s-process in low-mass AGB stars 
tends to produce both light and heavy s-process elements. 
If the third component represents the AGB contribution, 
one would expect it to have the same sign for the light and 




[Fe/H] 



Figure 22. The [/is/Fe]-[Fe/H] of the solar neighbourhood, 
Fornax and the LMC stars. The symbo ls in grey represent 
the sol ar neighbou r hood compilation from lBurris et a L||2000|) 
(plus) iFulbriehtl | |2000| . [2002) C crosses) and iReddv et al.l 
feool |2006£ (open circles). T he blue filled circles are Fornax 
stars from lbetarte et al.l ll201 Cll). The re d filled triangles are the 
LMC stars from iPompdaetah ] 1I2OO8I '). The two dwarf galax- 
ies are overabundant in heavy s-process elements, compared 
to solar neighbourhood stars. 



heavy s-process elements. However, this is not the case for 
the dwarf galaxies. This result is not totally unexpected. As 
shown in Fig.[22]and Fig. 1231 observations for dwarf galaxies 
have shown over- abundances of [hs/Fe] at high metallicity 
but similar abundances or even under-abundances for the 
[Is/Fe]. 

This could be explained by the argument that the slower 
SFH in these galaxies allows low-mass AGB stars to con- 
tribute at lower metallicity: AGB stars working in a metal- 
poor environment have higher ratio of neutrons to Fe-peak 
seeds and will therefore preferentially produce heavier s- 
process elements (cf. Section [2]). 

If this is the case, we should see some similarities be- 
tween solar neighbourhood CEMP-s stars and dwarf galax- 
ies. It has been proposed that CEMP-s stars have suffered 
from AGB binary mass transfer at low metallicity. AGB 
stars which operate at low metallicity should, by the same 
argument, preferentially produce hs than Is. As CEMP-s 
stars are rare objects, we use compilation from the SAGA 
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Figure 24. The Y (Is) to Ba (hs) ratios as a function of [Fe/H] for 
solar neighbourhood not-CEMP stars, CEMP-s stars and dwarf 
galaxies stars. The solar neighbourhood not-CEMP stars symbols 
are the same as in Fig. [7] CEMP-s stars are plotted with black 
open diamonds. Fornax stars are plotted with black crosses and 
the LMC stars are plotted with black open squares. 



consistent with the PCA analysis results showing that LEPP 
contribution is still visible at the Fornax metallicity range 
(-1.2 < [Fe/H] < -0.8), as shown in Fig.[l8l because there 
are also less r-process events to homogenize the ejecta. 



[Fe/H] 

Figure 23. The [k/Fe]-[Fe/H] of the solar neighbourhood, 
Fornax and the LMC stars. The symbols are the same as in 
Fig. 1221 The two dwarf galaxies have similar or lower abun- 
dances of light s-process elements compared to solar neigh- 
bourhood stars. 



database (|Suda et al.ll2008l ). a nd select stars with [C/Fe ] > 1 
and [Ba/Fe] > 1 f criteria from lBeers fc Christliebllioosl 'l. For 
those stars with multiple measurements, we take the mea- 
surement from the study with the high est resolution an d 
signal-to-noise ratio (for more details, see ISudaet al.ll2oTll ). 
In Fig. 1241 we show the [Y/Ba] for the solar neighbourhood 
not-CEMP stars, CEMP-s stars and dwarf galaxies stars. 
This figure is consistent with our speculation. Both solar 
neighbourhood CEMP-s stars and dwarf galaxies stars show 
lower [Y/Ba] than the solar neighbourhood not-CEMP stars. 
This suggests that n-capture elements for both CEMP-s 
stars and dwarf galaxies could have been produced by the 
s-process in AGB stars operating in a metal-poor environ- 
ment. 

Alternatively, the 'normal' abundances of light s- 
process element for dwarf galaxies can be explained if dwarf 
galajcies do not form many massive stars during their forma- 
tion. This leads to a smaller contribution from LEPP and 
smaller production of light s-process elements in the forma- 
tion phase of these galaxies. The under-production of light 
s-process elements from LEPP is then compensated by the 
onset of low-mass AGB star at low metallicities. The smaller 
LEPP contribution at low metallicity is not necessarily in- 



6.3 All elements 

6.3.1 Low metallicity 

The interpretation of the PCA analysis of all elements at 
low metallicity as shown in Fig. |S] can be summarized as 
follows: 

The first principal component could be the contribu- 
tion of the r-process since it involves substantial production 
of all of our n-capture elements. There is also substantial 
Q-element production together with the r-process produc- 
tion. This supports the view that the r-process occurs in 
core-collapse supernovae rather than neutron star mergers. 
It is generally agreed that the a-elements at low metallicity 
are produced in core-collapse supernovae. If this interpreta- 
tion is correct, among a-elements, the composition of the 
first principal component as shown in Fig. [8] suggests that 
core-collapse supernovae that involve the r-process produce 
more heavier a-elements (Ca and Ti) than light a-elements 
(Mg). In the yields from iKobavashi et al.l (|2006l ). heavier a- 
elements are produced more in massive progenitors and/or 
with higher explosion energy. Our PCA results may suggest 
that the r-process is associated with more massive progeni- 
tors and/or high explosion energy. 

The second and third principal components should be 
discussed together. The second component shows anticorre- 
lation between a-elements and Fe-peak elements whereas the 
third component shows the production of both a-elements 
and Fe-peak elements. We suggest that this could be ten- 
tatively explained by th e hype r novae-supernovae mode l 
(|Umeda fc Nomotol |2002| . l2005l : IKobavashi eFal] |2006| ). 
The 'normal' supernovae produce more a-elements than 
iron (see also IWooslev fc Weaver! 1 19951 : IChieffi fc LimongH 
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|2002| ). However, the amounts are con strained from Galac- 
tic chemical evolution models (e.g. iTimmes et al.l 1 19951 : 
iKobavashi et al.] l2006l ) . Theoretically, since the formation 
of neutron stars or blackholes is uncertain, the remnant 
mass is determined by a mass-cut. For low-mass supernovae 
(< 20 Mq), the ejected Fe mass is constrained to be about 
O.O7M0 from the light-curves and spectra of individual su- 
pernovae. For massive supernovae (> 20 Mq), the observa- 
tional data is limited, but the significant production of Fe 
(together with a-elements) is supported by the light-curve 
and spectra. With higher explosion energy (> 10 x 10^^ erg), 
it is possible to produce enough amounts of heavy Fe-peak 
elemen ts including Zn and Co to fit the observed abundances 
ratios (jKobavashi et al1l2006t ). Our PCA analysis seems to 
well-match this scenario: the third component could be in- 
terpreted as the contribution of hypernovae and the second 
component could be interpreted as the contribution from 
'normal' supernovae. 

We examine this idea by comparing the First Stars Sur- 
vey sample which has a lower mean metallicity with the 
Barklem's sample. Since hypernovae are preferentially pro- 
duced by more massive progenitors and there fore slightly 
shorter life span. IKobavashi fc Nakasatd (|2011al ) introduced 
metallicity dependence of the hypernovae fraction and ar- 
gued that the contribution of hypernovae is larger at lower 
metallicity. However, the fraction of 'normal' supernovae and 
hypernovae is uncertain. To examine these ideas, we analyze 
the C-subspace of Mg, Ca, Ti, Co, Ni, Zn. In this subspace, 
we interpret the component that has positive contribution 
from all elements to represent the hypernovae contribution 
and the component that shows anticorrelation between the 
Q-elements (Mg, Ca, Ti) and heavy Fe-peak elements (Co, 
Ni, Zn) to be the usual supernovae contribution. With these 
assumptions, we found that for the First Stars Survey sam- 
ple, the hypernovae component makes up 58.5% of the two 
contributions, whereas for the higher-metallicity Barklem's 
sample, the ratio decreases to 52.8%. 

The a-elements contribution appears in all of the first 
three components. However the second and third compo- 
nents do not show a visible contribution from all n-capture 
elements. This might imply that, although r-process pro- 
duction is accompanied by a-elements yield, the reverse is 
not always true. If the r-process occurs in core collapse su- 
pernovae, then perhaps it can be triggered only if certain 
conditions or progenitor mass range are satisfied, as sug- 
gested by previous work (jWanaio fc Ishimaru. 2006 1. 

The Al abundance was corrected by -1-0.6 dex for NLTE 
effects. If this correction is not made, the results remains 
largely unaltered, except that Al is contributing to the sec- 
ond component. Al is an light odd-Z element which comes 
from ^■^Ne, and thus the yields de pends on the metallic- 
ity of progen itor stars (e.g Kobavas hi et al.ll2006h : see also 
lArnettI l|l97ll ). Fig. [SS] shows that, relative to light even-Z el- 
ements such as Mg, light odd-Z elements such as Na and Al 
are indeed metallicity-dependent and are suppressed at low 
metallicity. This shows that the production of light odd-Z 
elements is indeed metallicity-dependent and is suppressed 
at low metallcity. Therefore, although our NLTE-correction 
for Al is crude and does not depend on the stellar parame- 
ters, the correction is in the correct direction and is required 
to give a sensible PCA result- i.e. Al should not be con- 
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Figure 25. These plot show the ratio of light odd-Z elements 
(Al and Na) to light even-Z elements/o-elements (Mg) as a 
function of [Fe/H] of solar neighbourhood halo and disk stars. 
The symbols are the same as in Fig. [7] Al, Na and Mg abun- 
dances for First S tars Survey arc NLTE-corrccted according to 
lAndrievskv et al . ( 2007, 2008, 2010) in these two plots. [Al/Fe] 
for Barklem's sample are NLTE-corrected by -1-0.6 dex. Both 
plots show an odd-even effect, i.e. light odd-Z elements are 
suppressed at low metallicity. 



tributing to the core-collapse supernovae component at low 
metallicity. 

The fourth Cr-Mn component could be interpreted as 
the contribution from incomplete Si-burning, favoring the 
idea that the production of Cr an d Mn relies on a differ - 
ent mass cut region as proposed bv lNakamura et al.l l)l999t ). 
If this interpretation is correct, this might suggest that the 
production ratio of elements synthesized in the incomplete 
Si-burning region are strongly correlated among themselves, 
but their production ratios are decoupled from the produc- 
tion of other elements such as Zn, Co, and Ni that are syn- 
thesized in the complete Si-burning region. This component 
should however be viewed with caution. Only Cr I is detected 
in Barklem's sample, and both Cri a nd Mn are suspected to 
require significant NLTE correction (|Bergemann fc GehrenI 
I2OO8I : iBergemann fc Cescuttill2010bl '). Therefore this dimen- 
sion could be spurious. Further discussion of this compo- 
nent requires the availability of a large homogeneous NLTE- 
corrected data set. 



22 Y. S. Ting, K. C. Freeman, et al. 



6.3.2 High metallicity 

Stars in this metallicity range have ages up to about 10 Gyrs, 
corresponding to more than a thousa nd generations of core- 
coUapso supernovae (refer to fig. 8 in lKoba vashi fc Nomotol 
[2009). The PGA results for the high metallicity sample 
shown in Fig. [TS] can be summarized as follows: 

The first component is a strong component (40% of the 
total variance) which consists of all light odd-Z elements, 
a-elements and Fe-peak elements. This could be interpreted 
as the loss of chemical dimensions after many generations 
of short-time-scale cor e -collap se su pernovae as shown in 
iKarlsson fc GustafssonI (|200ll ) and iBland-Hawthorn et al.l 



I 2010t ). After many generations of enrichment, elements that 
can be formed by short-time-scale core-collapse supernovae 
would have been well mixed in the ISM. In short, the first 
component is just a measure of the metal-richness of the gas 
reservoirs from which the stars formed. Much information 
was lost after the first few generations of supernovae. 

It is also interesting that Al contributes to the first com- 
ponent. This is different from the low metallicity sample for 
which the Al does not contribute. This is not unexpected. As 
discussed, Al is expected to be suppressed at low metallicity, 
but starts to contribute as metallicity increases. 

The second component can be interpreted as an r- 
process contribution, since all n-capture elements contribute 
to this component in the same sign. Unlike the low- 
metallicity case, a-elements do not seem to contribute to 
this component. As discussed above, this may be due to the 
small parameter space of core-collapse supernovae that in- 
volve the r-process. As massive stars become fewer at high 
metallicity, most of the core-collapse supernovae will prefer- 
entially only produce a-elements but not r-process elements. 
These a-elements, as they continue to be produced and ho- 
mogenized within the ISM, will only contribute to the hy- 
perplane orthogonal to the n-capture elements. If this is the 
case, the PGA results will show a strong first component 
that includes supernovae ejecta and leaves the occasionally- 
produced n-captured elements to form a second component. 
This is also consistent with the results that the s-process 
in low-mass AGB stars starts to dominate n-capture el- 
ements production at high metallicity. Alternatively, the 
PGA results at high metallicity may imply that there is 
another r-process site that doe s not produce a-elements , 
such as neutron star mergers (e.g. lFreiburghaus et al.|[l999bl : 
iRosswog et"allll999l ') 

The third component includes the contribution from 
light and heavy s-process elements, and as discussed in the 
n-capture elements section, could be interpreted as the low- 
mass AGB contribution. The appearance of a-elements con- 
stribution is unclear, but could be due to the envelope mass 
loss releasing the natal gas with heavier elements during the 
AGB phase. 



6.4 Wider region of survey 

As shown in Fig. 1161 among the 12 elements that we investi- 
gated, compared to solar neighbourhood disk stars, we find 
about 1 more dimension for open clusters which were sam- 
pled in a larger Galactic volume. The extra dimension for a 
larger Galactic volume is not surprising. Although stars from 
the solar neighbourhood could come from different regions 



of the galaxies via radial mixing, stars from open clusters 
far from the solar radius are unlikely to have undergone the 
same mixing processes as the solar neighbourhood stars. The 
larger Galactic volume (6 < re 20 kpc for open clusters, 
compared to 7.5 < re < 8.5 kpc for the solar neighbour- 
hood) is likely to retain more variation in C-space. 



6.5 K and Cu; APOGEE; the Ca-triplet region 

In this section, we briefly discuss the effects of adding K 
and Cu to the list of elements, the dimensionality of the 
APOGEE C-space, and the dimensionality of the C-space 
accessible from observations of the Ca-triplet region of the 
spectrum. 

We did not study the contribution from K and Cu, 
because these elements are absent from most of our sam- 
ples except for Reddy's high metallicity sample. The HER- 
MES bands are designed to include these two elements. We 
found that adding K and Cu to the high metallicity sample 
only slightly increases the effective dimensionality- from 6 
(for 17 elements) to about 6.5 (for 19 elements)- with K 
and Cu contributing equally. However, we suspect that K 
and Gu may add more dimensions at low metallicity be- 
cause we can still distinguish contributions from different 
types of core-collapse supernovae. Gu is an interesting el- 
ement; it is an odd Z elements pr eferably produced with 
hypernovae l|Kobavashi et all I2OO6I ) and may have signifi- 
cant contribution fro m the weak s-process (like Ga & Ge, 
IPignatari et al.ll2010 ''). K may be affected by aspherical ex- 
plosions (IMaeda fc N omoto 2003) and also by the neutrino 
luminosity i Kobavashi et al.ll2011dl ). The neutrino luminos- 
ity and weak s-process contribution could add promising 
extra dimensions at low metallicity. 

In this study, we focused on the HERMES C-spa,ce. The 
APOGEE high resolution near-infrared survey will measure 
the elements G, N, O, Na, Al, K, S, Mg, Si, Ca, Ti, V, Cr, 
Mn, Go, Ni but not n-capture elements, and we now esti- 
mate the dimensionality of this C-space. We exclude C, N, 
O, Na as before because they may be subject to internal 
mixing. First, we considered the 9-dimensional C-space (Al, 
Mg, Ga, Ti, V, Cr, Mn, Go, Ni) measured by both Barklem's 
low metallicity sample and Reddy's high metallicity sample. 
We found that the low metallicity sample has about 4.5 in- 
dependent dimensions, whereas the high metallicity sample 
has 4 independent dimensions. This is consistent with our 
previous observation that n-capture elements contribute 2 
independent dimensions at high metallicity- from r-process 
and s-process in low-mass AGB stars, but at low metallicity, 
n-capture elements only contribute 1 - 1.5 dimensions. 

In addition to these 9 dimensions, Reddy's sample also 
included Si, S and K. We found that adding Si alone does 
not add to the effective dimensionality, as Si is strongly cor- 
related with the other a-elements. However, we found that 
including S and K will add around 1 additional dimension to 
the C-space. Finally we summarize the effective dimensions 
of the C-space (and subspaces) of various environments in 
Table El 

The Ca-triplet region is widely used for large-scale spec- 
troscopic surveys, so we also estimated the dimensionality 
of the C-space defined by this region of the spectrum. For 
the metal-rich stars, the elements Al, Mg, Si, Ca, Ti and 
Ni define a space with 2 independent dimensions. For the 
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Table 3. Summary of effective dimensions in various environments and C-subspace. 



Environment Elements considered Eff. dims. No. of elements 





Neutron- Capture Elements 








Solar neighbourhood halo stars 


Y,Zr,Ba,La,Nd,Eu 


1 - 


- 2 


6 


Solar neighbourhood disk stars 


Y,Zr,Ba,La,Nd,Eu 


2 




6 


Fornax dSph galaxy 


Y,Ba,La,Nd,Eu 


3 




5 




All Elements (HERMES) 








Solar neighbourhood halo stars 


Al,Sc,Mg,Ca,Ti,V,Cr,Mn,Co,Ni,Zn,Y,Zr,Ba,La,Nd,Eu 


6 




17 


Solar neighbourhood disk stars 


Al,Sc,Mg,Si,Ca,Ti,V,Cr,Mn,Co,Ni,Zn,Y,Zr,Ba,Nd,Eu 


6 




17 


Solar neighbourhood disk stars 


Al,K,Sc,Mg,Si,Ca,Ti,V,Cr,Mn,Co,Ni,Cu,Zn,Y,Zr,Ba,Nd,Eu 6 - 


- 7 


19 


MW open clusters 


Al,Sc,Mg,Si,Ca,Ti,V,Cr,Co,Ni,Y,Ba,Nd 


6 - 


- 7 


13 


MW globular clusters 


Mg,Si,Ca,Ti,Y,Ba,Eu 


4 




7 


Fornax dSph galaxy 


Mg,Si,Ca,Ti,Cr,Ni,Y,Ba,Nd,Eu 


6 




10 


The Large Magellanic Cloud 


Sc,Mg,Si,Ca,Ti,V,Cr,Co,Ni,Y,Zr,Ba 


6 - 


- 7 


12 




All Elements (APOGEE) 








Solar neighbourhood halo stars 


Al,Mg,Ca,Ti,V,Cr,Mn,Co,Ni 


4 - 


- 5 


9 


Solar neighbourhood disk stars 


Al,Mg,Ca,Ti,V,Cr,Mn,Co,Ni 


4 




9 


Solar neighbourhood disk stars 


Al,K,S,Mg,Si,Ca,Ti,V,Cr,Mn,Co,Ni 


5 




12 



* We assume measurement uncertainty of 0.1 dex and adopt 85% cut-off for the ranked-eigenvalues cumulative percentages to derive the 
effective dimensions. This might not hold for open/globular clusters and dwarf galaxies as they have higher measurement/systematic 
uncertainty. Therefore the dimensions for open/globular clusters and dwarf galaxies in this table should be regarded as upper limit 



metal-poor stars, the elements Al, Mg, Ca, Ti and Ni de- 
fine a space with 3 independent dimensions. Note that this 
estimate excludes possible extra dimensions from the light 
elements C, N and O. 



7 CONCLUSION 

In this paper, we used Principal Component Analysis to 
study and interpret stellar element abundances. We dis- 
cussed a way to deal with the non-semi-definite positivity 
of the correlation matrix due to the incompleteness of the 
data. 

We illustrated the power of this method by confirm- 
ing the tight yield ratio of n-capture elements from the r- 
process, which confirms the universality of the r-process. We 
also traced the over-abundances of light s-process elements 
at low metallicity, and found that the relative contribution 
of the mechanism causing this over-abundances is decreas- 
ing with increasing metallicity. The site of LEPP is likely to 
be associated with massive stars, which is consistent with 
weak r-process and charged particle reactions. The method 
is also able to trace the s-process in low-mass AGB stars 
at high metallicity. The transition can be clearly shown us- 
ing data with intermediate metallicity and occurs at —2 < 
[Fe/H] < -1.5. 

Our analysis also suggests that r-process production 
sites are accompanied by a-element production, favouring 
core-collapse supernovae as the r-process site. This can be 
seen at low metallicity but is no longer visible at high metal- 
licity, which may be due to the small parameter space of the 
r-process site, or the inclusion of another site such as neu- 
tron star mergers that does not produce a-elements. 

The analysis indicates two types of core-collapse super- 
novae: one produces mainly Q-elements, the other produces 
both Qf-elements and Fe-peak elements with a large enhance- 



ment of heavy Fe-peak elements. This is consistent with hy- 
pernovae. We find that the contribution of hypernovae is 
larger at lower metallicity, which may be important to un- 
derstand the physics of hypernovae. We also discussed the 
Cr and Mn production from the incomplete Si burning re- 
gion, and the metallicity dependence of odd-Z element (Al 
in our analysis), but these may be affected by NLTE effect. 

To estimate the dimensionality of the C-space available 
to HERMES, we chose to work in [X/Fe] space, because all 
of the elements are highly correlated with [Fe/H]. We based 
our estimate of the dimensionality of [X/Fe] space on the 
simulations described in Section 14.1.31 indicating that the 
eigenvectors contributing to the first 85% of the cumulative 
sum of the ranked eigenvectors in [X/Fe] space are probably 
real. Due to the paucity of data and the potential contri- 
bution from internal mixing, we did not include Li, C, N, 
O and Na in our study. The PCA analysis of 17 elements 
shows that the chemical [X/Fe] space has about 6 dimen- 
sions both at high metallicity and low metallicity in the 
solar neighbourhood. We expect to have about 1 to 2 fur- 
ther independent dimensions of the [X/Fe] space from these 
elements that we excluded. Adding K and Cu for the HER- 
MES sample provides about another half of a dimension for 
the [X/Fe] space. Our ultimate goal here is to evaluate the 
dimensionality of the 25-element HERMES space. Including 
the further dimension from [Fe/H] itself would give about 
8 to 9 independent dimensions for the HERMES chemical 
[X/Fe] + [Fe/H] space. 

Although the number of C-space dimensions is similar at 
high and low metallicity, the interpretations of the principal 
components are very different in these two cases. For ex- 
ample, at high metallicity, we have an extra 'birth- imprint' 
from low-mass AGB stars. This extra contribution compen- 
sates the dimension loss due to homogenization of the core- 
collapse supernovae ejecta. 

Our analysis indicates that dwarf galaxies retain more 
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chemical inhomogeneity than the Milky Way disk. The anal- 
ysis also suggests that the s-process in AGB stars of dwarf 
galaxies preferentially produce heavier n-capture elements 
which is similar to the solar neighbourhood CEMP-s stars. 
Both of these effects may be produced by AGB stars work- 
ing in metal-poor environment. These findings are consistent 
with the view that dwarf galaxies have had a slower SFH 
than the Milky Way disk. 

The C-space for Galactic globular clusters and open 
clusters, which span a large Galactic volume, appear to de- 
fine a slightly higher-dimensional C space than that for the 
solar neighbourhood. A large homogeneous sample of abun- 
dances in globular clusters and open clusters is highly desir- 
able to confirm this conclusion. 

We showed that PCA can shed some insight on the 
underlying nucleosynthesis mechanisms. At low metallicity 
where the chemical space is less homogenized, this method 
could be exploited to put constraints on the number of 
mechanisms or, more precisely, on the number of param- 
eters that govern Galactic chemical evolution in different 
environments. 

For our future observational work in chemical tagging, 
the PC A method could combine with group-finding a lgo- 
rithms (|Agrawal et all 1 19981 : |Sh arma fc JohnstonI |2009|) to 
increase the efflciency of the algorithms for finding substruc- 
ture in chemical space. The group-finding algorithms are 
sensitive to redundant dimensions which decrease the den- 
sity of the clusters in chemical space. We expect that our 
PCA method can reduce the effective dimensionality of the 
chemical space without compromising any useful informa- 
tion. 
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APPENDIX A: 
ANALYSIS 



PRINCIPAL COMPONENT 



Consider an n dimensional space with random variables 
{Xi, . . . , Xn). In this section, we will re-derive and show 
that the normalized eigenvector corresponding to the largest 
eigenvalue of the correlation matrix is the direction where 
the standardized (i.e. normalized and mean-shifted) data 
cloud has the largest variance. Suppose we have m data 
points such that the random variables value of the q-th 
data point is {Xi^q, . . . ,X„,q). Let Xq, aq be the mean 
and the standard deviation of the random variable Xq re- 
spectively, and B be the matrix such that the p-th col- 
umn corresponds to the standardized p-th data point, i.e. 

-Xi X„.p-X„^ gj^^^ ^^^^ Q _ DOT 



the correlation matrix. 

Our goal is to find the normalized vector u cen- 
tered at {Xi,...,Xn) such that the projection of the 
standardized data cloud on u has the maximum vari- 
ance. The p-th standardized data point projection on u is 
• u, so B"^ u is the ensemble of u- 



that the mean of B^u = 0, and therefore the variance of the 
standardized data cloud projected on u is (u^B)-(B'^K)/m. 
To summarize, our goal is to maximize this value under the 
constraint u^u — 1. Without lost of generality, we will max- 
imize (u^B) ■ (B^n) instead of (ix^B) ■ (B^ii)/m. 

We use Lagrange multiplier formalism to deal with the 
constraint by introducing Langrange multiplier ai. In this 
formalism, the goal is to maximize the expression as shown 
in Equation I All 



Li{u, Qfi) = (B'^u)^ — Qi(lX^ — 1) 



(Al) 



By using the necessary condition of the maximality, i.e. 
= = 0, one can show that u has to be a eigen- 

ou aai ' o 

vector of C and = 1. Recall that C is symmetric and 
therefore orthogonally diagonalizable. Let the diagonal ma- 
trix to be D = diag(Ai), i.e. u^Cu = u^P^DPu for some 
orthonormal matrix P. Since P is orthonormal and — 1, 
it is easy to check that (Pit)^ = 1. Let Ai be the largest 
eigenvalue. One has 



max n"^P"^DPn 



max V 

\\-v\\ = l 



Dv = Al 



(A2) 



This can be attained by choosing u to be the eigenvector 
of C corresponding to the eigenvalue Ai. And thus it is the 
sufficient condition. Similarly, to find the second principal 
component w such that it is orthogonal to the first principal 
component u (i.e with constraint ■ u = 0) and accounts 
for the largest part of the rest of the variance, it suffices to 
maximize the expression as shown in Equation [ 

L2{w,l3i,l32) = {B^wf 



I3i{w^ ^ 1) - P2{w^ ■ u) 



(A3) 



By similar calculation, one can show that w being the 
eigenvector corresponding to the second largest eigenvalue 
is necessary and sufficient to maximize this expression, and 
so forth for the subsequent eigenvectors. Graphically, we are 
looking for a orthogonal transformation of the random vari- 
ables space such that after the transformation, the first axis 
will account for the largest part of the total variance, and 
second axis is orthogonal to the first axis and account for 
the largest part of the rest of the variance. It is important 
to note that the variances that they account, are given by 
the eigenvalues of the correlation matrix as shown in Equa- 
tion |M1 



projection of each standardized data point. One can check 



APPENDIX B: INCOMPLETE DATA SET 

If a data set is incomplete, in principle we can still calculate 
the Pearson's correlation for any two random variables by 
using only the data points that have value for both random 
variables, and therefore we can construct the correlation ma- 
trix entry by entry. However the problem of this approach is 
obvious: since the correlation matrix C is not BB"^ as before, 
although it is still symmetric, it might not be semi-positive- 
definite, i.e. it might have undesirable negative eigenvalues. 
Our goal is to find a semi-posit ive-definite matrix t hat is 
close to the correlation matrix C. lRebonato fc Jackell (ll999D 
suggested the following: 

Let S to be the ensemble of eigenvectors of matrix C, 
i.e. C-S = A-S, where A = diag(Ai), and Ai the eigenvalues. 
If C is not semi-positive-definite, it has at least one negative 
eigenvalue. We define the positive diagonal matrix A' = 
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diag(A^): 

./ ,/ jAi ifAi^O /-^D1^ 

^ ■ ^' = l0 ifA.<0 (^1) 
and the diagonal 'scaling' matrix T = diag(fi): 

T : U=[J2^L^'^Y' (B2) 

m 

Let B' = 's/TSv'A', where the square root of a diagonal 
matrix is defined as the square root of each of its diagonal 
entry. Finally we define: C = B'B'"^. One would expect C to 
be quite close to C since C = S"^ AS^ and C = \/TSA'S\/T. 
The lost from A — > A' is compensated by the rescaling 
matrix T. There are better ways to optimize the search of 
C but they are mostly computational much more demand- 
ing than this method. In our case, this estimation is good 
enough since it gives reasonable small errors both in term 
of ei = Eij(C,, - C[,f and ea = Er=i(^- " ^0', where 
are eigenvalues of C. 



APPENDIX C: WEIGHTED TOTAL LEAST 
SQUARE 

This method is adopted from iKrvstek fc Ant"onl (120071 ). As 
discussed in Section [4]2] our goal is to minimize Equation [l] 
Instead of considering the best fit line y = ax -\- h us- 
ing variables a and 6, iKrvstek fc Anto"nl ( 2007l l suggested 
a change of variable — > i^ — ^,^) x R+, where a = tan(a) 

(a,b)i >(c«,p) 

and b = p/ cos(a). They showed that in this case, Equation[T] 
becomes 



71 — 2 ^ Ux^k Q= + cos'' a 



For our case, we assume Ux,k = Uy,k ~ <J, for all fc. Therefore 
we have a very neat expression: 

1 " 

X^(a,p) = — — y^iykcosa -Xk sm a- pf (C2) 

(j^(n — 2) 

We use Truncated-Newton Method (TNMIN.pro in IDL, 
written by Craig B. Markwardt) to search for the minimal 
point (ao,Po)- Furthermore, we can approximate the uncer- 
tainty of the parameters estimation using the inverse Hessian 
of x^- More explicitly: 



o-^(p) cov(p,q) a 2 ^^J" 
cov(a,p) cr2(a) ) \ x^p xL 



(C3) 



, 2 d X 2 d X 2 2 9 X 

Wnere Xpp = gpH i Xaa = "g^i Xap ~ Xpa = a^gp 



